From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 17090 invoked by alias); 31 Jan 2003 12:16:02 -0000 Mailing-List: contact gcc-prs-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Archive: List-Post: List-Help: Sender: gcc-prs-owner@gcc.gnu.org Received: (qmail 16862 invoked by uid 71); 31 Jan 2003 12:16:01 -0000 Resent-Date: 31 Jan 2003 12:16:01 -0000 Resent-Message-ID: <20030131121601.16861.qmail@sources.redhat.com> Resent-From: gcc-gnats@gcc.gnu.org (GNATS Filer) Resent-Cc: gcc-prs@gcc.gnu.org, gcc-bugs@gcc.gnu.org Resent-Reply-To: gcc-gnats@gcc.gnu.org, peturr02@ru.is Received: (qmail 14765 invoked by uid 48); 31 Jan 2003 12:13:33 -0000 Message-Id: <20030131121333.14762.qmail@sources.redhat.com> Date: Fri, 31 Jan 2003 12:16:00 -0000 From: peturr02@ru.is Reply-To: peturr02@ru.is To: gcc-gnats@gcc.gnu.org X-Send-Pr-Version: gnatsweb-2.9.3 (1.1.1.1.2.31) Subject: libstdc++/9520: Garbled input from wcin X-SW-Source: 2003-01/txt/msg01750.txt.bz2 List-Id: >Number: 9520 >Category: libstdc++ >Synopsis: Garbled input from wcin >Confidential: no >Severity: serious >Priority: medium >Responsible: unassigned >State: open >Class: sw-bug >Submitter-Id: net >Arrival-Date: Fri Jan 31 12:16:01 UTC 2003 >Closed-Date: >Last-Modified: >Originator: peturr02@ru.is >Release: gcc-3.2.1 >Organization: >Environment: Red Hat Linux 8.0 >Description: If wcin.rdbuf()->sgetc() is called followed by wcin.rdbuf()->sbumpc(), the return value is not always the same, unless the charset in use happens to be ISO-8859-1. Reason: basic_filebuf::_M_underflow_common returns wide characters to the FILE* using ungetc. This happens to work if the charset is ISO-8859-1 (because the values are the same as in UCS-4), but breaks for all other single-byte charsets. Note that there is an even more serious bug here when dealing with multibyte charsets: in general a single wide character can have been converted from more than one narrow character. This is however shadowed by other problems in _M_underflow_common. >How-To-Repeat: See attachment. >Fix: >Release-Note: >Audit-Trail: >Unformatted: ----gnatsweb-attachment---- Content-Type: text/plain; name="ungetcbug.cc" Content-Disposition: inline; filename="ungetcbug.cc" #include #include #include #include #include #include #include #include int main() { using namespace std; const char* name = "tmp"; filebuf fbuf1; fbuf1.open(name, ios_base::out | ios_base::trunc); for (int i = 1; i < 256; ++i) fbuf1.sputc(static_cast(i)); fbuf1.close(); int fd = open(name, O_RDONLY); assert(fd != -1); dup2(fd, 0); locale loc ("en_US.ISO-8859-15"); locale::global(loc); wcin.imbue(loc); for (int j = 1; j < 256; ++j) { wint_t c1 = wcin.rdbuf()->sgetc(); wint_t c2 = wcin.rdbuf()->sbumpc(); assert(c1 == c2); assert(c1 != WEOF); } return 0; }