public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug java/11755] New: non-ASCII characters treated as errors even in comments
@ 2003-08-01  0:36 bero at arklinux dot org
  2003-08-01  0:43 ` [Bug java/11755] " pinskia at physics dot uc dot edu
                   ` (4 more replies)
  0 siblings, 5 replies; 6+ messages in thread
From: bero at arklinux dot org @ 2003-08-01  0:36 UTC (permalink / raw)
  To: gcc-bugs

[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain, Size: 892 bytes --]

PLEASE REPLY TO gcc-bugzilla@gcc.gnu.org ONLY, *NOT* gcc-bugs@gcc.gnu.org.

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=11755

           Summary: non-ASCII characters treated as errors even in comments
           Product: gcc
           Version: 3.4
            Status: UNCONFIRMED
          Severity: normal
          Priority: P2
         Component: java
        AssignedTo: unassigned at gcc dot gnu dot org
        ReportedBy: bero at arklinux dot org
                CC: gcc-bugs at gcc dot gnu dot org

gcj treats non-ASCII characters (e.g. German umlauts) as errors even if they're 
used in comments. 
 
Verified in 3.3 branch and HEAD. 
 
e.g. OpenOffice uses umlauts in comments in Java code; therefore I assume the 
Sun JDK handles them as expected. 
 
Test case: 
 
// Test äöüß 
class test { 
	public static void main(String[] args) { 
		System.out.printtln("Test"); 
	} 
}


^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug java/11755] non-ASCII characters treated as errors even in comments
  2003-08-01  0:36 [Bug java/11755] New: non-ASCII characters treated as errors even in comments bero at arklinux dot org
@ 2003-08-01  0:43 ` pinskia at physics dot uc dot edu
  2003-08-01  0:47 ` [Bug java/11755] non-ASCII characters treated as errors even in comments if LANG=C bero at arklinux dot org
                   ` (3 subsequent siblings)
  4 siblings, 0 replies; 6+ messages in thread
From: pinskia at physics dot uc dot edu @ 2003-08-01  0:43 UTC (permalink / raw)
  To: gcc-bugs

PLEASE REPLY TO gcc-bugzilla@gcc.gnu.org ONLY, *NOT* gcc-bugs@gcc.gnu.org.

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=11755


pinskia at physics dot uc dot edu changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |WAITING


------- Additional Comments From pinskia at physics dot uc dot edu  2003-08-01 00:43 -------
Can you provide the output of "gcj -v"?
In 3.3.1 (20030707) and mainline (20030730) on linux, I cannot reproduce this.


^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug java/11755] non-ASCII characters treated as errors even in comments if LANG=C
  2003-08-01  0:36 [Bug java/11755] New: non-ASCII characters treated as errors even in comments bero at arklinux dot org
  2003-08-01  0:43 ` [Bug java/11755] " pinskia at physics dot uc dot edu
@ 2003-08-01  0:47 ` bero at arklinux dot org
  2003-08-01  0:55 ` pinskia at physics dot uc dot edu
                   ` (2 subsequent siblings)
  4 siblings, 0 replies; 6+ messages in thread
From: bero at arklinux dot org @ 2003-08-01  0:47 UTC (permalink / raw)
  To: gcc-bugs

PLEASE REPLY TO gcc-bugzilla@gcc.gnu.org ONLY, *NOT* gcc-bugs@gcc.gnu.org.

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=11755


bero at arklinux dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|WAITING                     |NEW
     Ever Confirmed|                            |1
            Summary|non-ASCII characters treated|non-ASCII characters treated
                   |as errors even in comments  |as errors even in comments
                   |                            |if LANG=C


------- Additional Comments From bero at arklinux dot org  2003-08-01 00:47 -------
$ gcj -v |tail -n1 
gcc version 3.3.1 20030728 (Ark Linux 1.0 3.3.1-0.cvs20030728.1ark) 
$ gcj-34 -v |tail -n1 
gcc version 3.4 20030729 (Ark Linux 1.0 3.4-0.cvs20030729.1ark) 
 
 
A quick investigation shows it happens only if LANG is set to C.


^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug java/11755] non-ASCII characters treated as errors even in comments if LANG=C
  2003-08-01  0:36 [Bug java/11755] New: non-ASCII characters treated as errors even in comments bero at arklinux dot org
  2003-08-01  0:43 ` [Bug java/11755] " pinskia at physics dot uc dot edu
  2003-08-01  0:47 ` [Bug java/11755] non-ASCII characters treated as errors even in comments if LANG=C bero at arklinux dot org
@ 2003-08-01  0:55 ` pinskia at physics dot uc dot edu
  2003-08-01  1:06 ` tromey at gcc dot gnu dot org
  2003-08-03 20:28 ` pinskia at physics dot uc dot edu
  4 siblings, 0 replies; 6+ messages in thread
From: pinskia at physics dot uc dot edu @ 2003-08-01  0:55 UTC (permalink / raw)
  To: gcc-bugs

PLEASE REPLY TO gcc-bugzilla@gcc.gnu.org ONLY, *NOT* gcc-bugs@gcc.gnu.org.

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=11755


pinskia at physics dot uc dot edu changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |WAITING


------- Additional Comments From pinskia at physics dot uc dot edu  2003-08-01 00:55 -------
I cannot even reproduce it even if LANG=C with an unmodified compiler.
Also you are using a modified compiler, how modified is it, do you know?


^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug java/11755] non-ASCII characters treated as errors even in comments if LANG=C
  2003-08-01  0:36 [Bug java/11755] New: non-ASCII characters treated as errors even in comments bero at arklinux dot org
                   ` (2 preceding siblings ...)
  2003-08-01  0:55 ` pinskia at physics dot uc dot edu
@ 2003-08-01  1:06 ` tromey at gcc dot gnu dot org
  2003-08-03 20:28 ` pinskia at physics dot uc dot edu
  4 siblings, 0 replies; 6+ messages in thread
From: tromey at gcc dot gnu dot org @ 2003-08-01  1:06 UTC (permalink / raw)
  To: gcc-bugs

PLEASE REPLY TO gcc-bugzilla@gcc.gnu.org ONLY, *NOT* gcc-bugs@gcc.gnu.org.

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=11755



------- Additional Comments From tromey at gcc dot gnu dot org  2003-08-01 01:06 -------
gcj uses iconv() on platforms where it is available.
gcj also tries to use the locale's encoding by default.

On Linux, in the C locale, you are using ASCII.
glibc's iconv() is strict in this situation and will
reject non-ASCII characters.

The fix is to use something like "gcj --encoding=ISO-8859-1".

You don't see this with javac since Sun uses their own
encoding converters, and their ASCII converter isn't
strict about bytes whose high bit is set.


^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug java/11755] non-ASCII characters treated as errors even in comments if LANG=C
  2003-08-01  0:36 [Bug java/11755] New: non-ASCII characters treated as errors even in comments bero at arklinux dot org
                   ` (3 preceding siblings ...)
  2003-08-01  1:06 ` tromey at gcc dot gnu dot org
@ 2003-08-03 20:28 ` pinskia at physics dot uc dot edu
  4 siblings, 0 replies; 6+ messages in thread
From: pinskia at physics dot uc dot edu @ 2003-08-03 20:28 UTC (permalink / raw)
  To: gcc-bugs

PLEASE REPLY TO gcc-bugzilla@gcc.gnu.org ONLY, *NOT* gcc-bugs@gcc.gnu.org.

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=11755


pinskia at physics dot uc dot edu changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|WAITING                     |RESOLVED
         Resolution|                            |INVALID


------- Additional Comments From pinskia at physics dot uc dot edu  2003-08-03 20:28 -------
Not a bug based on Tom's analysis.
iconv is just more strict than Sun's javac.


^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2003-08-03 20:28 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2003-08-01  0:36 [Bug java/11755] New: non-ASCII characters treated as errors even in comments bero at arklinux dot org
2003-08-01  0:43 ` [Bug java/11755] " pinskia at physics dot uc dot edu
2003-08-01  0:47 ` [Bug java/11755] non-ASCII characters treated as errors even in comments if LANG=C bero at arklinux dot org
2003-08-01  0:55 ` pinskia at physics dot uc dot edu
2003-08-01  1:06 ` tromey at gcc dot gnu dot org
2003-08-03 20:28 ` pinskia at physics dot uc dot edu

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).