public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug java/11755] New: non-ASCII characters treated as errors even in comments
@ 2003-08-01 0:36 bero at arklinux dot org
2003-08-01 0:43 ` [Bug java/11755] " pinskia at physics dot uc dot edu
` (4 more replies)
0 siblings, 5 replies; 6+ messages in thread
From: bero at arklinux dot org @ 2003-08-01 0:36 UTC (permalink / raw)
To: gcc-bugs
[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain, Size: 892 bytes --]
PLEASE REPLY TO gcc-bugzilla@gcc.gnu.org ONLY, *NOT* gcc-bugs@gcc.gnu.org.
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=11755
Summary: non-ASCII characters treated as errors even in comments
Product: gcc
Version: 3.4
Status: UNCONFIRMED
Severity: normal
Priority: P2
Component: java
AssignedTo: unassigned at gcc dot gnu dot org
ReportedBy: bero at arklinux dot org
CC: gcc-bugs at gcc dot gnu dot org
gcj treats non-ASCII characters (e.g. German umlauts) as errors even if they're
used in comments.
Verified in 3.3 branch and HEAD.
e.g. OpenOffice uses umlauts in comments in Java code; therefore I assume the
Sun JDK handles them as expected.
Test case:
// Test äöüß
class test {
public static void main(String[] args) {
System.out.printtln("Test");
}
}
^ permalink raw reply [flat|nested] 6+ messages in thread
* [Bug java/11755] non-ASCII characters treated as errors even in comments
2003-08-01 0:36 [Bug java/11755] New: non-ASCII characters treated as errors even in comments bero at arklinux dot org
@ 2003-08-01 0:43 ` pinskia at physics dot uc dot edu
2003-08-01 0:47 ` [Bug java/11755] non-ASCII characters treated as errors even in comments if LANG=C bero at arklinux dot org
` (3 subsequent siblings)
4 siblings, 0 replies; 6+ messages in thread
From: pinskia at physics dot uc dot edu @ 2003-08-01 0:43 UTC (permalink / raw)
To: gcc-bugs
PLEASE REPLY TO gcc-bugzilla@gcc.gnu.org ONLY, *NOT* gcc-bugs@gcc.gnu.org.
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=11755
pinskia at physics dot uc dot edu changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|UNCONFIRMED |WAITING
------- Additional Comments From pinskia at physics dot uc dot edu 2003-08-01 00:43 -------
Can you provide the output of "gcj -v"?
In 3.3.1 (20030707) and mainline (20030730) on linux, I cannot reproduce this.
^ permalink raw reply [flat|nested] 6+ messages in thread
* [Bug java/11755] non-ASCII characters treated as errors even in comments if LANG=C
2003-08-01 0:36 [Bug java/11755] New: non-ASCII characters treated as errors even in comments bero at arklinux dot org
2003-08-01 0:43 ` [Bug java/11755] " pinskia at physics dot uc dot edu
@ 2003-08-01 0:47 ` bero at arklinux dot org
2003-08-01 0:55 ` pinskia at physics dot uc dot edu
` (2 subsequent siblings)
4 siblings, 0 replies; 6+ messages in thread
From: bero at arklinux dot org @ 2003-08-01 0:47 UTC (permalink / raw)
To: gcc-bugs
PLEASE REPLY TO gcc-bugzilla@gcc.gnu.org ONLY, *NOT* gcc-bugs@gcc.gnu.org.
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=11755
bero at arklinux dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|WAITING |NEW
Ever Confirmed| |1
Summary|non-ASCII characters treated|non-ASCII characters treated
|as errors even in comments |as errors even in comments
| |if LANG=C
------- Additional Comments From bero at arklinux dot org 2003-08-01 00:47 -------
$ gcj -v |tail -n1
gcc version 3.3.1 20030728 (Ark Linux 1.0 3.3.1-0.cvs20030728.1ark)
$ gcj-34 -v |tail -n1
gcc version 3.4 20030729 (Ark Linux 1.0 3.4-0.cvs20030729.1ark)
A quick investigation shows it happens only if LANG is set to C.
^ permalink raw reply [flat|nested] 6+ messages in thread
* [Bug java/11755] non-ASCII characters treated as errors even in comments if LANG=C
2003-08-01 0:36 [Bug java/11755] New: non-ASCII characters treated as errors even in comments bero at arklinux dot org
2003-08-01 0:43 ` [Bug java/11755] " pinskia at physics dot uc dot edu
2003-08-01 0:47 ` [Bug java/11755] non-ASCII characters treated as errors even in comments if LANG=C bero at arklinux dot org
@ 2003-08-01 0:55 ` pinskia at physics dot uc dot edu
2003-08-01 1:06 ` tromey at gcc dot gnu dot org
2003-08-03 20:28 ` pinskia at physics dot uc dot edu
4 siblings, 0 replies; 6+ messages in thread
From: pinskia at physics dot uc dot edu @ 2003-08-01 0:55 UTC (permalink / raw)
To: gcc-bugs
PLEASE REPLY TO gcc-bugzilla@gcc.gnu.org ONLY, *NOT* gcc-bugs@gcc.gnu.org.
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=11755
pinskia at physics dot uc dot edu changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|NEW |WAITING
------- Additional Comments From pinskia at physics dot uc dot edu 2003-08-01 00:55 -------
I cannot even reproduce it even if LANG=C with an unmodified compiler.
Also you are using a modified compiler, how modified is it, do you know?
^ permalink raw reply [flat|nested] 6+ messages in thread
* [Bug java/11755] non-ASCII characters treated as errors even in comments if LANG=C
2003-08-01 0:36 [Bug java/11755] New: non-ASCII characters treated as errors even in comments bero at arklinux dot org
` (2 preceding siblings ...)
2003-08-01 0:55 ` pinskia at physics dot uc dot edu
@ 2003-08-01 1:06 ` tromey at gcc dot gnu dot org
2003-08-03 20:28 ` pinskia at physics dot uc dot edu
4 siblings, 0 replies; 6+ messages in thread
From: tromey at gcc dot gnu dot org @ 2003-08-01 1:06 UTC (permalink / raw)
To: gcc-bugs
PLEASE REPLY TO gcc-bugzilla@gcc.gnu.org ONLY, *NOT* gcc-bugs@gcc.gnu.org.
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=11755
------- Additional Comments From tromey at gcc dot gnu dot org 2003-08-01 01:06 -------
gcj uses iconv() on platforms where it is available.
gcj also tries to use the locale's encoding by default.
On Linux, in the C locale, you are using ASCII.
glibc's iconv() is strict in this situation and will
reject non-ASCII characters.
The fix is to use something like "gcj --encoding=ISO-8859-1".
You don't see this with javac since Sun uses their own
encoding converters, and their ASCII converter isn't
strict about bytes whose high bit is set.
^ permalink raw reply [flat|nested] 6+ messages in thread
* [Bug java/11755] non-ASCII characters treated as errors even in comments if LANG=C
2003-08-01 0:36 [Bug java/11755] New: non-ASCII characters treated as errors even in comments bero at arklinux dot org
` (3 preceding siblings ...)
2003-08-01 1:06 ` tromey at gcc dot gnu dot org
@ 2003-08-03 20:28 ` pinskia at physics dot uc dot edu
4 siblings, 0 replies; 6+ messages in thread
From: pinskia at physics dot uc dot edu @ 2003-08-03 20:28 UTC (permalink / raw)
To: gcc-bugs
PLEASE REPLY TO gcc-bugzilla@gcc.gnu.org ONLY, *NOT* gcc-bugs@gcc.gnu.org.
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=11755
pinskia at physics dot uc dot edu changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|WAITING |RESOLVED
Resolution| |INVALID
------- Additional Comments From pinskia at physics dot uc dot edu 2003-08-03 20:28 -------
Not a bug based on Tom's analysis.
iconv is just more strict than Sun's javac.
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2003-08-03 20:28 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2003-08-01 0:36 [Bug java/11755] New: non-ASCII characters treated as errors even in comments bero at arklinux dot org
2003-08-01 0:43 ` [Bug java/11755] " pinskia at physics dot uc dot edu
2003-08-01 0:47 ` [Bug java/11755] non-ASCII characters treated as errors even in comments if LANG=C bero at arklinux dot org
2003-08-01 0:55 ` pinskia at physics dot uc dot edu
2003-08-01 1:06 ` tromey at gcc dot gnu dot org
2003-08-03 20:28 ` pinskia at physics dot uc dot edu
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).