From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 13827 invoked by alias); 3 Mar 2012 13:42:03 -0000 Received: (qmail 13814 invoked by uid 22791); 3 Mar 2012 13:42:02 -0000 X-SWARE-Spam-Status: No, hits=-2.8 required=5.0 tests=ALL_TRUSTED,AWL,BAYES_00 X-Spam-Check-By: sourceware.org Received: from localhost (HELO sourceware.org) (127.0.0.1) by sourceware.org (qpsmtpd/0.43rc1) with ESMTP; Sat, 03 Mar 2012 13:41:49 +0000 From: "bugdal at aerifal dot cx" To: glibc-bugs@sources.redhat.com Subject: [Bug libc/13757] mbstowcs(3) unable to handle 8bit characters. Date: Sat, 03 Mar 2012 13:42:00 -0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: glibc X-Bugzilla-Component: libc X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: bugdal at aerifal dot cx X-Bugzilla-Status: REOPENED X-Bugzilla-Priority: P2 X-Bugzilla-Assigned-To: unassigned at sourceware dot org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Changed-Fields: CC Message-ID: In-Reply-To: References: X-Bugzilla-URL: http://sourceware.org/bugzilla/ Auto-Submitted: auto-generated Content-Type: text/plain; charset="UTF-8" MIME-Version: 1.0 Mailing-List: contact glibc-bugs-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: glibc-bugs-owner@sourceware.org X-SW-Source: 2012-03/txt/msg00046.txt.bz2 http://sourceware.org/bugzilla/show_bug.cgi?id=13757 Rich Felker changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |bugdal at aerifal dot cx --- Comment #7 from Rich Felker 2012-03-03 13:41:43 UTC --- The charmap for the C locale should definitely not be ISO-8859-anything. All that does is encourage broken, non-portable program behavior. If you are going to use mbrtowc and family and intend to process characters not in the portable character set, you MUST call setlocale for the LC_CTYPE category. The system calls you referred to (e.g. readdir and readlink) do not use any character map. They process bytes. In any case, if you wanted the C locale to match the filesystem's encoding, it would have to be UTF-8, not ISO-8859-1, at least on any modern system, and I'm pretty sure that's not what you want since you seem to be advocating for very backwards behavior... -- Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.