From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 6900 invoked by alias); 18 Dec 2011 22:34:30 -0000 Received: (qmail 6723 invoked by uid 22791); 18 Dec 2011 22:34:30 -0000 X-SWARE-Spam-Status: No, hits=-2.8 required=5.0 tests=ALL_TRUSTED,AWL,BAYES_00 X-Spam-Check-By: sourceware.org Received: from localhost (HELO sourceware.org) (127.0.0.1) by sourceware.org (qpsmtpd/0.43rc1) with ESMTP; Sun, 18 Dec 2011 22:34:17 +0000 From: "ezyang at mit dot edu" To: glibc-bugs@sources.redhat.com Subject: [Bug libc/13518] New: iconv truncates input with //IGNORE Date: Sun, 18 Dec 2011 22:34:00 -0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: new X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: glibc X-Bugzilla-Component: libc X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: ezyang at mit dot edu X-Bugzilla-Status: NEW X-Bugzilla-Priority: P2 X-Bugzilla-Assigned-To: drepper.fsp at gmail dot com X-Bugzilla-Target-Milestone: --- X-Bugzilla-Changed-Fields: Message-ID: X-Bugzilla-URL: http://sourceware.org/bugzilla/ Auto-Submitted: auto-generated Content-Type: text/plain; charset="UTF-8" MIME-Version: 1.0 Mailing-List: contact glibc-bugs-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: glibc-bugs-owner@sourceware.org X-SW-Source: 2011-12/txt/msg00043.txt.bz2 http://sourceware.org/bugzilla/show_bug.cgi?id=13518 Bug #: 13518 Summary: iconv truncates input with //IGNORE Product: glibc Version: 2.14 Status: NEW Severity: normal Priority: P2 Component: libc AssignedTo: drepper.fsp@gmail.com ReportedBy: ezyang@mit.edu Classification: Unclassified iconv seems to truncate inputs at around 8157 bytes if they contain invalid characters for the target set, even if IGNORE is specified. Steps to reproduce: 1. Download iconv.html ezyang@javelin:~$ wget http://www.oppcharts.com/iconv.html 2. Attempt to convert UTF-8 to iso-8859-1//IGNORE Expected behavior (from libiconv-1.14): ezyang@javelin:~/Dev/glibc/build$ ~/Desktop/libiconv-1.14/src/iconv_no_i18n -f utf-8 -t iso-8859-1//IGNORE ~/iconv.html | wc -c 15312 Actual behavior (from latest Git glibc-2.14-567-ga4647e7): ezyang@javelin:~/Dev/glibc/build$ ./testrun.sh iconv/iconv_prog -f utf-8 -t iso-8859-1//IGNORE ~/iconv.html | wc -c iconv/iconv_prog: illegal input sequence at position 8168 8157 -- Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.