From mboxrd@z Thu Jan 1 00:00:00 1970 From: tromey@redhat.com To: gcc-gnats@gcc.gnu.org Subject: java/2319: invalid UTF-8 sequences should be rejected Date: Mon, 19 Mar 2001 08:36:00 -0000 Message-id: <20010319163215.1814.qmail@sourceware.cygnus.com> X-SW-Source: 2001-03/msg00130.html List-Id: >Number: 2319 >Category: java >Synopsis: invalid UTF-8 sequences should be rejected >Confidential: no >Severity: serious >Priority: medium >Responsible: unassigned >State: open >Class: sw-bug >Submitter-Id: net >Arrival-Date: Mon Mar 19 08:36:00 PST 2001 >Closed-Date: >Last-Modified: >Originator: Tom Tromey >Release: unknown-1.0 >Organization: >Environment: >Description: Currently the compiler accepts invalid UTF-8 sequences when reading a file. Instead we ought to diagnose such sequences as errors. Try compiling this Latin-1 encoded program with --encoding=UTF-8 to see the problem: public class Hello { public static void main ( String []arguments) { System.out.println ("Liberté, égalité, fraternité !"); } } >How-To-Repeat: >Fix: >Release-Note: >Audit-Trail: >Unformatted: