public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug libstdc++/58628] New: Incorrect std::isalpha results with UTF-8 locale on illumos
@ 2013-10-05  8:39 alp at rsu dot ru
  2022-02-24 18:28 ` [Bug libstdc++/58628] " egallager at gcc dot gnu.org
                   ` (2 more replies)
  0 siblings, 3 replies; 4+ messages in thread
From: alp at rsu dot ru @ 2013-10-05  8:39 UTC (permalink / raw)
  To: gcc-bugs

[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain; charset="UTF-8", Size: 3163 bytes --]

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=58628

            Bug ID: 58628
           Summary: Incorrect std::isalpha results with UTF-8 locale on
                    illumos
           Product: gcc
           Version: 4.4.7
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: libstdc++
          Assignee: unassigned at gcc dot gnu.org
          Reporter: alp at rsu dot ru

The following example when compiled with gcc 3.4,4.7 or 4.4  and launched with
en_US.UTF-8 locale on illumos says that char with 196 code is alphabetic,
however it's not a correct UTF-8 symbol.


$ env LANG=en_US.UTF-8 ./test_ctype
"�"
letter is �
(int)letter is 196
std::use_facet:true

If this program is compiled with Sun Studio Compiler (CC) results are:
$ env LANG=en_US.UTF-8 ./test_ctype_CC
"�"
letter is �
(int)letter is 196
std::use_facet:false

if I compile this program on Linux / FreeBSD , results are correct.

Related OpenIndiana bug report: 
    https://www.illumos.org/issues/4173
Discussion on illumos-dev:
    http://comments.gmane.org/gmane.os.illumos.devel/14193
>From gcc-bugs-return-431125-listarch-gcc-bugs=gcc.gnu.org@gcc.gnu.org Sat Oct 05 08:40:44 2013
Return-Path: <gcc-bugs-return-431125-listarch-gcc-bugs=gcc.gnu.org@gcc.gnu.org>
Delivered-To: listarch-gcc-bugs@gcc.gnu.org
Received: (qmail 9011 invoked by alias); 5 Oct 2013 08:40:44 -0000
Mailing-List: contact gcc-bugs-help@gcc.gnu.org; run by ezmlm
Precedence: bulk
List-Id: <gcc-bugs.gcc.gnu.org>
List-Archive: <http://gcc.gnu.org/ml/gcc-bugs/>
List-Post: <mailto:gcc-bugs@gcc.gnu.org>
List-Help: <mailto:gcc-bugs-help@gcc.gnu.org>
Sender: gcc-bugs-owner@gcc.gnu.org
Delivered-To: mailing list gcc-bugs@gcc.gnu.org
Received: (qmail 8990 invoked by uid 48); 5 Oct 2013 08:40:41 -0000
From: "alp at rsu dot ru" <gcc-bugzilla@gcc.gnu.org>
To: gcc-bugs@gcc.gnu.org
Subject: [Bug libstdc++/58628] Incorrect std::isalpha results with UTF-8 locale on illumos
Date: Sat, 05 Oct 2013 08:40:00 -0000
X-Bugzilla-Reason: CC
X-Bugzilla-Type: changed
X-Bugzilla-Watch-Reason: None
X-Bugzilla-Product: gcc
X-Bugzilla-Component: libstdc++
X-Bugzilla-Version: 4.4.7
X-Bugzilla-Keywords:
X-Bugzilla-Severity: normal
X-Bugzilla-Who: alp at rsu dot ru
X-Bugzilla-Status: UNCONFIRMED
X-Bugzilla-Priority: P3
X-Bugzilla-Assigned-To: unassigned at gcc dot gnu.org
X-Bugzilla-Target-Milestone: ---
X-Bugzilla-Flags:
X-Bugzilla-Changed-Fields: attachments.created
Message-ID: <bug-58628-4-APc8AtHb3x@http.gcc.gnu.org/bugzilla/>
In-Reply-To: <bug-58628-4@http.gcc.gnu.org/bugzilla/>
References: <bug-58628-4@http.gcc.gnu.org/bugzilla/>
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: 7bit
X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/
Auto-Submitted: auto-generated
MIME-Version: 1.0
X-SW-Source: 2013-10/txt/msg00269.txt.bz2
Content-length: 220

http://gcc.gnu.org/bugzilla/show_bug.cgi?idX628

--- Comment #1 from Alexander Pyhalov <alp at rsu dot ru> ---
Created attachment 30958
  --> http://gcc.gnu.org/bugzilla/attachment.cgi?id0958&actioníit
ctype test


^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug libstdc++/58628] Incorrect std::isalpha results with UTF-8 locale on illumos
  2013-10-05  8:39 [Bug libstdc++/58628] New: Incorrect std::isalpha results with UTF-8 locale on illumos alp at rsu dot ru
@ 2022-02-24 18:28 ` egallager at gcc dot gnu.org
  2022-09-16 18:11 ` alp at rsu dot ru
  2022-09-16 18:16 ` alp at rsu dot ru
  2 siblings, 0 replies; 4+ messages in thread
From: egallager at gcc dot gnu.org @ 2022-02-24 18:28 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=58628

Eric Gallager <egallager at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Last reconfirmed|                            |2022-02-24
             Status|UNCONFIRMED                 |WAITING
                 CC|                            |egallager at gcc dot gnu.org
     Ever confirmed|0                           |1

--- Comment #2 from Eric Gallager <egallager at gcc dot gnu.org> ---
3.4, 4.4, and 4.7 are pretty old at this point; does this still happen with
newer versions of GCC?

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug libstdc++/58628] Incorrect std::isalpha results with UTF-8 locale on illumos
  2013-10-05  8:39 [Bug libstdc++/58628] New: Incorrect std::isalpha results with UTF-8 locale on illumos alp at rsu dot ru
  2022-02-24 18:28 ` [Bug libstdc++/58628] " egallager at gcc dot gnu.org
@ 2022-09-16 18:11 ` alp at rsu dot ru
  2022-09-16 18:16 ` alp at rsu dot ru
  2 siblings, 0 replies; 4+ messages in thread
From: alp at rsu dot ru @ 2022-09-16 18:11 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=58628

--- Comment #3 from Alexander Pyhalov <alp at rsu dot ru> ---
I still see this behavior with gcc version 10.4.0.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug libstdc++/58628] Incorrect std::isalpha results with UTF-8 locale on illumos
  2013-10-05  8:39 [Bug libstdc++/58628] New: Incorrect std::isalpha results with UTF-8 locale on illumos alp at rsu dot ru
  2022-02-24 18:28 ` [Bug libstdc++/58628] " egallager at gcc dot gnu.org
  2022-09-16 18:11 ` alp at rsu dot ru
@ 2022-09-16 18:16 ` alp at rsu dot ru
  2 siblings, 0 replies; 4+ messages in thread
From: alp at rsu dot ru @ 2022-09-16 18:16 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=58628

--- Comment #4 from Alexander Pyhalov <alp at rsu dot ru> ---
If it helps, the last comment from illumos-gate bug report says

"From what I can tell ctype<wchar_t>::_M_initialize_ctype() in
gcc-5.1.0/ibstdc++-v3/config/locale/generic/ctype_members.cc:248 is basically
just calling btowc(i) for all i <= 0 <= 255 and storing the result. If
std::locale::classic() is called before the setlocale() call in the test
program, things happen to work, but apparently the initialization uses whatever
the current locale is."

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2022-09-16 18:16 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2013-10-05  8:39 [Bug libstdc++/58628] New: Incorrect std::isalpha results with UTF-8 locale on illumos alp at rsu dot ru
2022-02-24 18:28 ` [Bug libstdc++/58628] " egallager at gcc dot gnu.org
2022-09-16 18:11 ` alp at rsu dot ru
2022-09-16 18:16 ` alp at rsu dot ru

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).