public inbox for gcc-bugs@sourceware.org help / color / mirror / Atom feed
From: "joseph at codesourcery dot com" <gcc-bugzilla@gcc.gnu.org> To: gcc-bugs@gcc.gnu.org Subject: [Bug preprocessor/9449] UCNs not recognized in identifiers (c++/c99) Date: Fri, 07 Jan 2005 10:28:00 -0000 [thread overview] Message-ID: <20050107102755.26974.qmail@sourceware.org> (raw) In-Reply-To: <20030127145600.9449.rearnsha@arm.com> [-- Warning: decoded text below may be mangled, UTF-8 assumed --] [-- Attachment #1: Type: text/plain, Size: 2640 bytes --] ------- Additional Comments From joseph at codesourcery dot com 2005-01-07 10:27 ------- Subject: Re: UCNs not recognized in identifiers (c++/c99) On Fri, 7 Jan 2005, zack at gcc dot gnu dot org wrote: > An obvious rebuttal to this is that the compiler used in step 4 is broken. As > you say, the C standard references ISO10646 not Unicode and the concept of > normalization does not exist in ISO10646, and this could be taken to imply that > no normalization shall occur. However, there is no unambiguous statement to > that effect in the standard, and there is strong quality-of-implementation I think the relevant text is that treating identifiers as sequences of characters and UCNs denoting single characters. I've had no on-list response yet to the query about this I sent to the WG14 reflector on Tuesday (reflector message 10698), with the HEBREW LETTER SHIN WITH DAGESH AND SHIN DOT examples. > pressure in the opposite direction. Put aside the standard for a moment: are > users going to like a compiler that insists that "Å" (U+00C5) and "Å" (U+212B) > are not the same character? [It happens that on my screen those are ever so > slightly different, but that's just luck - and X11 will only let me type U+00C5; > I resorted to hex-editing to get the other.] The question of appearance is the same as that for U+0041 LATIN CAPITAL LETTER A, U+0391 GREEK CAPITAL LETTER ALPHA, U+0410 CYRILLIC CAPITAL LETTER A. Will users like such a compiler less than one which doesn't allow them to use their native language in identifiers at all? > normalization, as a defensive measure against such external changes. > You could argue that this is just another way for C programmers to shoot > themselves in the foot, but I don't think the myriad ways that already > exist are a reason to add more. (It's WG14 and WG21 that added the new way, not us. And it may be that if they are to become convinced there is any mistake then they must see real world problems arising with real implementations of the existing standards, rather than hypothetical problems. Mistakes were made in C99 of adding features in general without adequate implementation experience; changing them without experience showing what is a genuine problem could be seen as another such mistake to avoid.) I could believe there could be a case for -fextended-identifiers required to enable UCNs in identifiers until there is more experience, with documentation along the lines of that formerly associated with -pedantic "This option is not intended to be useful; ...". -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=9449
next prev parent reply other threads:[~2005-01-07 10:28 UTC|newest] Thread overview: 40+ messages / expand[flat|nested] mbox.gz Atom feed top [not found] <20030127145600.9449.rearnsha@arm.com> 2003-11-05 6:51 ` pinskia at gcc dot gnu dot org 2004-12-16 2:16 ` zack at codesourcery dot com 2004-12-16 2:37 ` gdr at integrable-solutions dot net 2004-12-16 2:54 ` joseph at codesourcery dot com 2004-12-16 3:04 ` zack at codesourcery dot com 2004-12-16 12:33 ` joseph at codesourcery dot com 2004-12-16 14:23 ` joseph at codesourcery dot com 2004-12-16 23:05 ` joseph at codesourcery dot com 2005-01-07 7:10 ` zack at gcc dot gnu dot org 2005-01-07 10:28 ` joseph at codesourcery dot com [this message] 2005-01-07 14:27 ` gdr at integrable-solutions dot net 2005-01-07 15:02 ` joseph at codesourcery dot com 2005-01-07 15:39 ` Gabriel Dos Reis 2005-01-07 15:39 ` gdr at integrable-solutions dot net 2005-01-08 2:20 ` geoffk at gcc dot gnu dot org 2005-01-08 4:11 ` joseph at codesourcery dot com 2005-01-08 4:45 ` gdr at integrable-solutions dot net 2005-01-08 5:32 ` joseph at codesourcery dot com 2005-01-09 3:20 ` gdr at integrable-solutions dot net 2005-02-21 21:34 ` jsm28 at gcc dot gnu dot org 2005-02-21 23:39 ` joseph at codesourcery dot com 2005-02-21 23:50 ` zack at codesourcery dot com 2005-02-21 23:51 ` zack at codesourcery dot com 2005-02-22 0:10 ` zack at codesourcery dot com 2005-02-22 3:14 ` neil at daikokuya dot co dot uk 2005-02-22 10:50 ` joseph at codesourcery dot com 2005-02-22 11:24 ` joseph at codesourcery dot com 2005-02-22 12:00 ` joseph at codesourcery dot com 2005-03-12 11:15 ` jsm28 at gcc dot gnu dot org 2005-07-05 2:14 ` pinskia at gcc dot gnu dot org 2005-09-15 22:34 ` geoffk at gcc dot gnu dot org 2005-09-15 22:53 ` Neil Booth 2005-09-15 22:54 ` joseph at codesourcery dot com 2005-09-15 22:54 ` neil at daikokuya dot co dot uk 2005-09-15 22:59 ` neil at daikokuya dot co dot uk 2005-09-15 23:38 ` joseph at codesourcery dot com 2005-09-16 0:02 ` geoffk at geoffk dot org [not found] <bug-9449-4@http.gcc.gnu.org/bugzilla/> 2014-11-05 16:20 ` jsm28 at gcc dot gnu.org 2014-11-05 16:23 ` jsm28 at gcc dot gnu.org 2014-11-06 21:09 ` jsm28 at gcc dot gnu.org
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=20050107102755.26974.qmail@sourceware.org \ --to=gcc-bugzilla@gcc.gnu.org \ --cc=gcc-bugs@gcc.gnu.org \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: linkBe sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).