From mboxrd@z Thu Jan 1 00:00:00 1970 From: Tom Tromey To: nobody@gcc.gnu.org Cc: gcc-prs@gcc.gnu.org Subject: Re: java/2319: invalid UTF-8 sequences should be rejected Date: Mon, 19 Mar 2001 09:16:00 -0000 Message-id: <20010319171601.28057.qmail@sourceware.cygnus.com> X-SW-Source: 2001-03/msg00133.html List-Id: The following reply was made to PR java/2319; it has been noted by GNATS. From: Tom Tromey To: "Joseph S. Myers" Cc: , Subject: Re: java/2319: invalid UTF-8 sequences should be rejected Date: 19 Mar 2001 10:19:18 -0700 >>>>> "Joseph" == Joseph S Myers writes: Joseph> Also note that the invalid sequences that should be rejected Joseph> include over-long sequences and UTF-8 encodings that would map Joseph> to values in the UTF-16 surrogate range. I agree, with the sole exception that I think we should accept the Java form of \0. Java represents this as a two-byte sequence and it seems reasonable that a Java compiler would accept this form. Tom