From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [216.205.24.124]) by sourceware.org (Postfix) with ESMTP id EEE463858D29 for ; Thu, 15 Jul 2021 08:56:26 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org EEE463858D29 Received: from mimecast-mx01.redhat.com (mimecast-mx01.redhat.com [209.132.183.4]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-411-W4GvQFK4O-eVIt9ZPOUFoA-1; Thu, 15 Jul 2021 04:56:23 -0400 X-MC-Unique: W4GvQFK4O-eVIt9ZPOUFoA-1 Received: from smtp.corp.redhat.com (int-mx04.intmail.prod.int.phx2.redhat.com [10.5.11.14]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx01.redhat.com (Postfix) with ESMTPS id 3F8421023F4F; Thu, 15 Jul 2021 08:56:22 +0000 (UTC) Received: from oldenburg.str.redhat.com (ovpn-112-73.phx2.redhat.com [10.3.112.73]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 8FAD95D9C6; Thu, 15 Jul 2021 08:56:17 +0000 (UTC) From: Florian Weimer To: Paul Eggert Cc: Carlos O'Donell , libc-alpha@sourceware.org Subject: Re: C.UTF-8 review References: <87o8b5ds5q.fsf@oldenburg.str.redhat.com> <5edf60ff-1c3e-7086-b78f-5707c20e33d2@cs.ucla.edu> Date: Thu, 15 Jul 2021 10:56:15 +0200 In-Reply-To: <5edf60ff-1c3e-7086-b78f-5707c20e33d2@cs.ucla.edu> (Paul Eggert's message of "Wed, 14 Jul 2021 18:06:57 -0500") Message-ID: <878s287y6o.fsf@oldenburg.str.redhat.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.2 (gnu/linux) MIME-Version: 1.0 X-Scanned-By: MIMEDefang 2.79 on 10.5.11.14 X-Mimecast-Spam-Score: 0 X-Mimecast-Originator: redhat.com Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Status: No, score=-6.9 required=5.0 tests=BAYES_00, DKIMWL_WL_HIGH, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, RCVD_IN_DNSWL_LOW, RCVD_IN_MSPIKE_H4, RCVD_IN_MSPIKE_WL, SPF_HELO_NONE, SPF_NONE, TXREP autolearn=ham autolearn_force=no version=3.4.4 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on server2.sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 15 Jul 2021 08:56:28 -0000 * Paul Eggert: > Dumb question. Will C.UTF-8 have the same worst-case strcoll > performance that en_US.UTF-8 does? I'm asking because I wonder whether > we can recommend C.UTF-8 as a workaround for the strcoll performance > bug, in cases where plain C is not appropriate. > > https://sourceware.org/bugzilla/show_bug.cgi?id=3D18441 > > https://debbugs.gnu.org/cgi/bugreport.cgi?bug=3D49340 I'm not sure if that advice is correct =E2=80=A6 With the new C.UTF-8 implementation, strcoll automatically switches to strcmp, so it will be as fast as it can be. Thanks, Florian