From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 30319 invoked by alias); 26 Jun 2004 22:00:55 -0000 Mailing-List: contact java-prs-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Subscribe: List-Archive: List-Post: List-Help: , Sender: java-prs-owner@gcc.gnu.org Received: (qmail 19367 invoked by uid 48); 26 Jun 2004 21:56:58 -0000 Date: Sat, 26 Jun 2004 22:10:00 -0000 Message-ID: <20040626215658.19366.qmail@sourceware.org> From: "joeclark at iastate dot edu" To: java-prs@gcc.gnu.org In-Reply-To: <20040323044412.14687.joeclark@iastate.edu> References: <20040323044412.14687.joeclark@iastate.edu> Reply-To: gcc-bugzilla@gcc.gnu.org Subject: [Bug java/14687] Incorrect UTF-8 byte->String conversion X-Bugzilla-Reason: CC X-SW-Source: 2004-q2/txt/msg00287.txt.bz2 List-Id: ------- Additional Comments From joeclark at iastate dot edu 2004-06-26 21:56 ------- Okay, I've attached a java file that illustrates the bug. For me, the UTF character set does *NOT* work for *ANY* ASCII characters. Perhaps I'm entering the bytes wrong or something; if this is the case, the attached code should make my error clear. When I run this with Sun's JDK, I get the same value for all 127 values. With gcj, *every* UTF-8 conversion results in a "?" string; here's a quick snippet: b 109 --> utf ? & iso m b 110 --> utf ? & iso n b 111 --> utf ? & iso o -- What |Removed |Added ---------------------------------------------------------------------------- Status|RESOLVED |UNCONFIRMED Resolution|INVALID | http://gcc.gnu.org/bugzilla/show_bug.cgi?id=14687