From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 95951 invoked by alias); 29 Oct 2015 03:59:00 -0000 Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner@cygwin.com Mail-Followup-To: cygwin@cygwin.com Received: (qmail 73809 invoked by uid 89); 29 Oct 2015 03:58:46 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-1.8 required=5.0 tests=AWL,BAYES_00,RP_MATCHES_RCVD,SPF_HELO_PASS autolearn=ham version=3.3.2 X-HELO: mx1.redhat.com Received: from mx1.redhat.com (HELO mx1.redhat.com) (209.132.183.28) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with (AES256-GCM-SHA384 encrypted) ESMTPS; Thu, 29 Oct 2015 03:58:45 +0000 Received: from int-mx11.intmail.prod.int.phx2.redhat.com (int-mx11.intmail.prod.int.phx2.redhat.com [10.5.11.24]) by mx1.redhat.com (Postfix) with ESMTPS id E5061341ADA for ; Thu, 29 Oct 2015 03:58:43 +0000 (UTC) Received: from [10.3.113.189] (ovpn-113-189.phx2.redhat.com [10.3.113.189]) by int-mx11.intmail.prod.int.phx2.redhat.com (8.14.4/8.14.4) with ESMTP id t9T3whbE014107 for ; Wed, 28 Oct 2015 23:58:43 -0400 Subject: Re: Bug in collation functions? To: cygwin@cygwin.com References: <563148AF.1000502@cornell.edu> From: Eric Blake Openpgp: url=http://people.redhat.com/eblake/eblake.gpg Message-ID: <5631996D.7040908@redhat.com> Date: Thu, 29 Oct 2015 07:50:00 -0000 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.3.0 MIME-Version: 1.0 In-Reply-To: <563148AF.1000502@cornell.edu> Content-Type: multipart/signed; micalg=pgp-sha256; protocol="application/pgp-signature"; boundary="QFNUOsHSj3ln9AcWVt6W29pOMmeJiuilg" X-IsSubscribed: yes X-SW-Source: 2015-10/txt/msg00524.txt.bz2 --QFNUOsHSj3ln9AcWVt6W29pOMmeJiuilg Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Content-length: 852 On 10/28/2015 04:14 PM, Ken Brown wrote: > It's my understanding that collation is supposed to take whitespace and > punctuation into account in the POSIX locale but not in other locales. Not quite right. It is up to the locale definition whether whitespace affects collation. But you are correct that in the POSIX locale, whitespace must not be ignored in collation. > This doesn't seem to be the case on Cygwin. Here's a test case using > wcscoll, but the same problem occurs with strcoll. That's because the locale definitions are different in cygwin than they are in glibc. But it is not a bug in Cygwin; POSIX allows for different systems to have different locale definitions while still using the same locale name like en_US.UTF-8. --=20 Eric Blake eblake redhat com +1-919-301-3266 Libvirt virtualization library http://libvirt.org --QFNUOsHSj3ln9AcWVt6W29pOMmeJiuilg Content-Type: application/pgp-signature; name="signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="signature.asc" Content-length: 604 -----BEGIN PGP SIGNATURE----- Version: GnuPG v2 Comment: Public key at http://people.redhat.com/eblake/eblake.gpg Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/ iQEcBAEBCAAGBQJWMZltAAoJEKeha0olJ0NqWFwIAKp1Uc8jEQxPsQPyJO/Icxhs r5uh9DPX57/D5WkIprie6Bh2h0oztNj5+c8q5O8umD/mBbO26lsEHlviwOiq2jCy kKua240Ns0wBEMcDOArginOJv4hK0JejkscaBRfcNFJRK2mgX3LJ3GJHxd/wlg/V PCWZ1FBls9haTDAjyyaOWcguvIZlpVfXaP8sWpE9gZGJ/gxA6qC4yD9WCcWF5m8q BCfmLF1kEg76cdSclaefJ80pJRmXmtLQVCRkQvTCYV/a3tIdpJM+AYjPtT2ZJMqQ P5SIUTE8AQdTjgGEsyhwFThpZEJNXtRQn06Mp0YfbLUrbLoYCSGmS9Ew52VT2J8= =UQ6z -----END PGP SIGNATURE----- --QFNUOsHSj3ln9AcWVt6W29pOMmeJiuilg--