From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 17952 invoked by alias); 8 Apr 2007 12:18:10 -0000 Received: (qmail 17934 invoked by uid 48); 8 Apr 2007 12:18:00 -0000 Date: Sun, 08 Apr 2007 12:18:00 -0000 From: "d+bugzilla at vdr dot jp" To: glibc-bugs@sources.redhat.com Message-ID: <20070408131800.4335.d+bugzilla@vdr.jp> Reply-To: sourceware-bugzilla@sourceware.org Subject: [Bug libc/4335] New: EastAsianAmbiguous character width is always 1 in UTF-8 locale X-Bugzilla-Reason: CC Mailing-List: contact glibc-bugs-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: glibc-bugs-owner@sourceware.org X-SW-Source: 2007-04/txt/msg00021.txt.bz2 According to /usr/share/i18n/charmaps/UTF-8.gz, Character width is 1 by default. W(Wide) and F(Full Width) are 2. % Character width according to Unicode 3.2. % - Default width is 1. % - Double-width characters have width 2; generated from % "grep '^[^;]*;[WF]' EastAsianWidth.txt" % and "grep '^[^;]*;[^WF]' EastAsianWidth.txt" % - Non-spacing characters have width 0; generated from PropList.txt or % "grep '^[^;]*;[^;]*;[^;]*;[^;]*;NSM;' UnicodeData.txt" % - Format control characters have width 0; generated from % "grep '^[^;]*;[^;]*;Cf;' UnicodeData.txt" % - Zero width characters have width 0; generated from % "grep '^[^;]*;ZERO WIDTH ' UnicodeData.txt" A(Ambiguous) is expected that it is context-sensitive, but its width is always 1 irrelevant to context. According to http://www.unicode.org/reports/tr11/#Recommendations > When mapping Unicode to East Asian legacy character encodings > > * Wide Unicode characters always map to fullwidth characters. > * Narrow (and neutral) Unicode characters always map to halfwidth characters. > * Halfwidth Unicode characters always map to halfwidth characters. > * Ambiguous Unicode characters always map to fullwidth characters. I think EastAsianAmbiguous character width should be 2 in CJK UTF-8 locale. -- Summary: EastAsianAmbiguous character width is always 1 in UTF-8 locale Product: glibc Version: unspecified Status: NEW Severity: normal Priority: P2 Component: libc AssignedTo: drepper at redhat dot com ReportedBy: d+bugzilla at vdr dot jp CC: glibc-bugs at sources dot redhat dot com http://sourceware.org/bugzilla/show_bug.cgi?id=4335 ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.