public inbox for gcc-bugs@sourceware.org help / color / mirror / Atom feed
* [Bug c++/49952] New: Unicode literals do not generate errors as prescribed by the FDIS standard @ 2011-08-02 22:09 z0sh at sogetthis dot com 2011-08-02 22:32 ` [Bug c++/49952] [C++0x] " paolo.carlini at oracle dot com ` (4 more replies) 0 siblings, 5 replies; 6+ messages in thread From: z0sh at sogetthis dot com @ 2011-08-02 22:09 UTC (permalink / raw) To: gcc-bugs http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49952 Summary: Unicode literals do not generate errors as prescribed by the FDIS standard Product: gcc Version: 4.6.1 Status: UNCONFIRMED Severity: trivial Priority: P3 Component: c++ AssignedTo: unassigned@gcc.gnu.org ReportedBy: z0sh@sogetthis.com Host: Linux x86 Referring to the standard: http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2007/n2442.htm The character literal \UNNNNNNNN must only accept characters in the range 0-0x10FFFF, excluding surrogates. However, GCC allows 31-bit values above 0x10FFFF. To wit, the following compiles: char32_t s[] = U"\U0010FFFF\U7FFFFFFF"; It may be that the actual wording of the FDIS (2.3.2, p.19) is more relaxed than in the reference I gave above and that this behaviour is in fact intentional, but I thought I bring it up anyway. ^ permalink raw reply [flat|nested] 6+ messages in thread
* [Bug c++/49952] [C++0x] Unicode literals do not generate errors as prescribed by the FDIS standard 2011-08-02 22:09 [Bug c++/49952] New: Unicode literals do not generate errors as prescribed by the FDIS standard z0sh at sogetthis dot com @ 2011-08-02 22:32 ` paolo.carlini at oracle dot com 2011-08-03 11:19 ` joseph at codesourcery dot com ` (3 subsequent siblings) 4 siblings, 0 replies; 6+ messages in thread From: paolo.carlini at oracle dot com @ 2011-08-02 22:32 UTC (permalink / raw) To: gcc-bugs http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49952 Paolo Carlini <paolo.carlini at oracle dot com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |kris.van.hees at oracle dot | |com --- Comment #1 from Paolo Carlini <paolo.carlini at oracle dot com> 2011-08-02 22:32:10 UTC --- Kris, are you willing to triage this PR? ^ permalink raw reply [flat|nested] 6+ messages in thread
* [Bug c++/49952] [C++0x] Unicode literals do not generate errors as prescribed by the FDIS standard 2011-08-02 22:09 [Bug c++/49952] New: Unicode literals do not generate errors as prescribed by the FDIS standard z0sh at sogetthis dot com 2011-08-02 22:32 ` [Bug c++/49952] [C++0x] " paolo.carlini at oracle dot com @ 2011-08-03 11:19 ` joseph at codesourcery dot com 2011-08-03 11:37 ` z0sh at sogetthis dot com ` (2 subsequent siblings) 4 siblings, 0 replies; 6+ messages in thread From: joseph at codesourcery dot com @ 2011-08-03 11:19 UTC (permalink / raw) To: gcc-bugs http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49952 --- Comment #2 from joseph at codesourcery dot com <joseph at codesourcery dot com> 2011-08-03 11:19:01 UTC --- C and C++ reference ISO 10646 instead of Unicode, meaning that it is natural and proper for the full ISO 10646 range of values to be accepted instead of the restricted Unicode range. N3291 does appear to have this restriction on char32_t string (but not character) literals; C1X does not. ^ permalink raw reply [flat|nested] 6+ messages in thread
* [Bug c++/49952] [C++0x] Unicode literals do not generate errors as prescribed by the FDIS standard 2011-08-02 22:09 [Bug c++/49952] New: Unicode literals do not generate errors as prescribed by the FDIS standard z0sh at sogetthis dot com 2011-08-02 22:32 ` [Bug c++/49952] [C++0x] " paolo.carlini at oracle dot com 2011-08-03 11:19 ` joseph at codesourcery dot com @ 2011-08-03 11:37 ` z0sh at sogetthis dot com 2011-08-03 12:01 ` paolo.carlini at oracle dot com 2021-12-02 2:29 ` pinskia at gcc dot gnu.org 4 siblings, 0 replies; 6+ messages in thread From: z0sh at sogetthis dot com @ 2011-08-03 11:37 UTC (permalink / raw) To: gcc-bugs http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49952 --- Comment #3 from Kerrek SB <z0sh at sogetthis dot com> 2011-08-03 11:36:41 UTC --- Maybe it could trigger a warning in -pedantic mode? ^ permalink raw reply [flat|nested] 6+ messages in thread
* [Bug c++/49952] [C++0x] Unicode literals do not generate errors as prescribed by the FDIS standard 2011-08-02 22:09 [Bug c++/49952] New: Unicode literals do not generate errors as prescribed by the FDIS standard z0sh at sogetthis dot com ` (2 preceding siblings ...) 2011-08-03 11:37 ` z0sh at sogetthis dot com @ 2011-08-03 12:01 ` paolo.carlini at oracle dot com 2021-12-02 2:29 ` pinskia at gcc dot gnu.org 4 siblings, 0 replies; 6+ messages in thread From: paolo.carlini at oracle dot com @ 2011-08-03 12:01 UTC (permalink / raw) To: gcc-bugs http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49952 --- Comment #4 from Paolo Carlini <paolo.carlini at oracle dot com> 2011-08-03 12:00:30 UTC --- Adding a warning would be easy. ^ permalink raw reply [flat|nested] 6+ messages in thread
* [Bug c++/49952] [C++0x] Unicode literals do not generate errors as prescribed by the FDIS standard 2011-08-02 22:09 [Bug c++/49952] New: Unicode literals do not generate errors as prescribed by the FDIS standard z0sh at sogetthis dot com ` (3 preceding siblings ...) 2011-08-03 12:01 ` paolo.carlini at oracle dot com @ 2021-12-02 2:29 ` pinskia at gcc dot gnu.org 4 siblings, 0 replies; 6+ messages in thread From: pinskia at gcc dot gnu.org @ 2021-12-02 2:29 UTC (permalink / raw) To: gcc-bugs https://gcc.gnu.org/bugzilla/show_bug.cgi?id=49952 Andrew Pinski <pinskia at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- Last reconfirmed| |2021-12-02 Keywords| |diagnostic Ever confirmed|0 |1 Status|UNCONFIRMED |NEW --- Comment #5 from Andrew Pinski <pinskia at gcc dot gnu.org> --- With -std=c++20, we do warn: <source>:2:21: warning: \U7FFFFFFF is outside the UCS codespace 2 | char32_t s[] = U"\U0010FFFF\U7FFFFFFF"; | ^~~~~~~~~~~~~~~~~~~~~~~ This warning was implemented in r10-3414-g0900e29cdbc5. I wonder if we should just enable it for all C++ standards and above of C++20+. ^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2021-12-02 2:29 UTC | newest] Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2011-08-02 22:09 [Bug c++/49952] New: Unicode literals do not generate errors as prescribed by the FDIS standard z0sh at sogetthis dot com 2011-08-02 22:32 ` [Bug c++/49952] [C++0x] " paolo.carlini at oracle dot com 2011-08-03 11:19 ` joseph at codesourcery dot com 2011-08-03 11:37 ` z0sh at sogetthis dot com 2011-08-03 12:01 ` paolo.carlini at oracle dot com 2021-12-02 2:29 ` pinskia at gcc dot gnu.org
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).