public inbox for glibc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug libc/4335] New: EastAsianAmbiguous character width is always 1 in UTF-8 locale
@ 2007-04-08 12:18 d+bugzilla at vdr dot jp
  2007-06-02 23:43 ` [Bug libc/4335] " bruno at clisp dot org
                   ` (4 more replies)
  0 siblings, 5 replies; 7+ messages in thread
From: d+bugzilla at vdr dot jp @ 2007-04-08 12:18 UTC (permalink / raw)
  To: glibc-bugs

According to /usr/share/i18n/charmaps/UTF-8.gz,
Character width is 1 by default.  W(Wide) and F(Full Width) are 2.

% Character width according to Unicode 3.2.
% - Default width is 1.
% - Double-width characters have width 2; generated from
%        "grep '^[^;]*;[WF]' EastAsianWidth.txt"
%   and  "grep '^[^;]*;[^WF]' EastAsianWidth.txt"
% - Non-spacing characters have width 0; generated from PropList.txt or
%   "grep '^[^;]*;[^;]*;[^;]*;[^;]*;NSM;' UnicodeData.txt"
% - Format control characters have width 0; generated from
%   "grep '^[^;]*;[^;]*;Cf;' UnicodeData.txt"
% - Zero width characters have width 0; generated from
%   "grep '^[^;]*;ZERO WIDTH ' UnicodeData.txt"

A(Ambiguous) is expected that it is context-sensitive,
but its width is always 1 irrelevant to context.

According to http://www.unicode.org/reports/tr11/#Recommendations

> When mapping Unicode to East Asian legacy character encodings
> 
>     * Wide Unicode characters always map to fullwidth characters.
>     * Narrow (and neutral) Unicode characters always map to halfwidth characters.
>     * Halfwidth Unicode characters always map to halfwidth characters.
>     * Ambiguous Unicode characters always map to fullwidth characters.

I think EastAsianAmbiguous character width should be 2 in CJK UTF-8 locale.

-- 
           Summary: EastAsianAmbiguous character width is always 1 in UTF-8
                    locale
           Product: glibc
           Version: unspecified
            Status: NEW
          Severity: normal
          Priority: P2
         Component: libc
        AssignedTo: drepper at redhat dot com
        ReportedBy: d+bugzilla at vdr dot jp
                CC: glibc-bugs at sources dot redhat dot com


http://sourceware.org/bugzilla/show_bug.cgi?id=4335

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.


^ permalink raw reply	[flat|nested] 7+ messages in thread
[parent not found: <bug-4335-131@http.sourceware.org/bugzilla/>]

end of thread, other threads:[~2012-12-19 10:51 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2007-04-08 12:18 [Bug libc/4335] New: EastAsianAmbiguous character width is always 1 in UTF-8 locale d+bugzilla at vdr dot jp
2007-06-02 23:43 ` [Bug libc/4335] " bruno at clisp dot org
2007-06-10 13:05 ` d+bugzilla at vdr dot jp
2007-11-27 16:04 ` d+bugzilla at vdr dot jp
2008-11-25 17:28 ` d+bugzilla at vdr dot jp
2009-02-28  7:38 ` d+bugzilla at vdr dot jp
     [not found] <bug-4335-131@http.sourceware.org/bugzilla/>
2012-12-19 10:51 ` schwab@linux-m68k.org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).