From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 25009 invoked by alias); 2 Dec 2017 06:09:17 -0000 Mailing-List: contact newlib-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: newlib-owner@sourceware.org Received: (qmail 24999 invoked by uid 89); 2 Dec 2017 06:09:17 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-5.0 required=5.0 tests=AWL,BAYES_00,GIT_PATCH_2,KAM_LAZY_DOMAIN_SECURITY,KAM_SHORT,KB_WAM_FROM_NAME_SINGLEWORD,RCVD_IN_DNSWL_NONE autolearn=ham version=3.3.2 spammy=Hx-spam-relays-external:sk:mrelaye, H*RU:sk:mrelaye, H*r:sk:mrelaye, von X-HELO: mout.kundenserver.de Received: from mout.kundenserver.de (HELO mout.kundenserver.de) (212.227.126.135) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Sat, 02 Dec 2017 06:09:15 +0000 Received: from [192.168.178.45] ([95.91.246.210]) by mrelayeu.kundenserver.de (mreue003 [212.227.15.167]) with ESMTPSA (Nemesis) id 0M4mmf-1fJsqD2YUX-00yxO4 for ; Sat, 02 Dec 2017 07:09:12 +0100 Subject: Ping: Unicode update of width and other character properties References: To: newlib@sourceware.org From: Thomas Wolff X-Forwarded-Message-Id: Message-ID: Date: Sat, 02 Dec 2017 11:25:00 -0000 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.5.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit X-UI-Out-Filterresults: notjunk:1;V01:K0:SbwHt49oMYk=:GNQvvEiDiO4fiQ2gUTfq0b 9veyK/+KDZXUfOmKV3Q4efzC1ble4d+VYJrWYWuOzO76RWp+gUSqDRhN3vnBpZXBBObqJpJFa NgZwOyPHySWuV/i4YZlLPHstO28eB4zjtodWDD6rxdJ3uLfPEoGuskAbKJ+2SRzSipH2FIkJB Pg4ObYZyJ+TN3sxIuS/mRYZiPFUCzJmeV58GwdI5kx+4aZnBgdylCmdvHLUj2HZkp+2+Ez9un Sg0huOOZfB+bFGGdoFDHh4VmSABTpqlAujLSxDsclFpJ9OrRcctiJRPZZVNT+D9NYjEFWs/Vy qaVynyLtSTDzCxNZbTCo94a/lWEMKl2HTyAKtGLt2aN96caumVyZc0/IxgMxwev0ogdcFYIZn rTVjvGgbQNe5toAN8mYOy1YaWKQgauZVOySWdg9e5EjI/8BUbxNJW0erysgR8KWyWTeHJd3RX mDXIduDQA+wNP393o2fnDtUdJCrLIsnIiYp14ILC5U9btTwqiSME3VFUpY73CdpJPWJGb93xE HWc4UtMktDEajg2HC4GRySx+x4ZzNbrrJiy9ReOy4d2j/2KGKvP4YwPDK2mNp5b10iDKwGc/H rL+xiAr4Nio3Hs0lTuD6x7CmWqrD2An/qIKfWcc4lMtqtL1WU8nMU8MfrCikS6E+s+vO91qSm 1Xa3sH/45G0Yib2yz5aEHlwEosIM9LWs7HGbj1nuklaclMkz1zikZHpvP+SzCrgArFb+Ainb2 hA32GY3QtZmkUj8PtD2En20QlgyWyCgUoHLVXU5LkTxLFyjQN+/9UpkjQ7hIL7BSCS9kTGdPY apu+FqN X-IsSubscribed: yes X-SW-Source: 2017/txt/msg01218.txt.bz2 Hi, this is to remind of may patch for wcwidth Unicode consistence, as requested. Thomas -------- Weitergeleitete Nachricht -------- Betreff: Unicode update of width and other character properties Datum: Sun, 6 Aug 2017 07:36:10 +0200 Von: Thomas Wolff An: newlib@sourceware.org Hi, this is a proposal to update wcwidth and the character properties functions isw*/towupper/towlower to Unicode 10.0, as discussed in the mail thread https://cygwin.com/ml/cygwin/2017-07/msg00366.html, as well as to simplify automatic generation of respective tables for an easier update step. Table size is moderate (using ranges for character properties) but there is still an option to reduce the two big tables in size. The patch can be retrieved from http://towo.net/cygwin/charprops10.zip . The Makefile.widthdata does not yet distinguish the two subdirectories (libc/string, libc/ctypw) as it comes from a common development directory. There is a test program in which comparison for isw*/tow* functions between current and patched implementation can be compared. I also provide a log of deviations of the new approach to the current implementation, based on Unicode 5.2 data, to compare and check. If there are any disputable cases, I would consider that of course. My main aim was actually to get the wcwidth data updated, for which the change is more obviously clear. Thanks Thomas