From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [216.205.24.124]) by sourceware.org (Postfix) with ESMTP id 4DFD23955406 for ; Mon, 19 Jul 2021 15:33:14 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 4DFD23955406 Received: from mail-qv1-f69.google.com (mail-qv1-f69.google.com [209.85.219.69]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-551-FlwLue7NNWORpem0rPgHjw-1; Mon, 19 Jul 2021 11:33:11 -0400 X-MC-Unique: FlwLue7NNWORpem0rPgHjw-1 Received: by mail-qv1-f69.google.com with SMTP id c22-20020a0ca9d60000b02902e60d75210eso15493539qvb.19 for ; Mon, 19 Jul 2021 08:33:11 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:organization :message-id:date:user-agent:mime-version:in-reply-to :content-language:content-transfer-encoding; bh=flXix4v5SgF/jylTJbkPlLoN7TKt06QLlIi39tEiR2Q=; b=nsmqTYwrMWS1GG3q9mz+DMSjAmAwtUgmoEm9EL3pg0Nn0GLndK7ukeI9CeczVEsblL 8pXnM1cH4R3HI7qsXGoROEkIER/EX860tUPXR71R1rUnwND5qZSlipig+JawrNDacz1N eBZtismYCmIr29etQWZYHhePLoiOd6wljTFnQ3pgzvWp0bsv7LMLIg1j2D398BgmF/gd e4SjPHg43jsqcDYc6cjNDlbLYFrWtzHWgXW4aLHbV8Ou0y+xQItcVL0Yw136bpnVuMO7 pi+J/sqvo5sbUDDLckCf+GtqT5+dg8r24foMnHG1ucn0lGnnWE+aUCOXzY2jcyB+om/y xE7w== X-Gm-Message-State: AOAM531IsOi4lBScw1ih61fkXHDZZv4IWwzEGO/wsvOVm2BbEIlqzmPk o+/JhpKH4ZRWnbHwWGrDfFSueoZ7VV/KWJkYRBgIrhypKn6PguN2hhskZrbJvU8OacYKT/p+bpp E1wayzVaQHDaYSxJ9oxo86l2lSOLOGrV0fV7c5lKPWciY4OlZWeFjbum2QTNGvQvy1PbBoA== X-Received: by 2002:ac8:5fcb:: with SMTP id k11mr22403760qta.102.1626708790758; Mon, 19 Jul 2021 08:33:10 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxMwB8NqRfL/rQYouY3PsvskFXEGbeXI1bz2jdyb644lJLzKuFvU8Gwe9EmPNwUzROw9f1jYw== X-Received: by 2002:ac8:5fcb:: with SMTP id k11mr22403744qta.102.1626708790497; Mon, 19 Jul 2021 08:33:10 -0700 (PDT) Received: from [192.168.1.16] (198-84-214-74.cpe.teksavvy.com. [198.84.214.74]) by smtp.gmail.com with ESMTPSA id i21sm6691039qti.45.2021.07.19.08.33.09 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Mon, 19 Jul 2021 08:33:09 -0700 (PDT) Subject: Re: C.UTF-8 review To: Florian Weimer , Paul Eggert Cc: libc-alpha@sourceware.org References: <87o8b5ds5q.fsf@oldenburg.str.redhat.com> <5edf60ff-1c3e-7086-b78f-5707c20e33d2@cs.ucla.edu> <878s287y6o.fsf@oldenburg.str.redhat.com> From: Carlos O'Donell Organization: Red Hat Message-ID: <3a2dab05-ee68-2593-cf9b-707eb1552ec5@redhat.com> Date: Mon, 19 Jul 2021 11:33:08 -0400 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.11.0 MIME-Version: 1.0 In-Reply-To: <878s287y6o.fsf@oldenburg.str.redhat.com> X-Mimecast-Spam-Score: 0 X-Mimecast-Originator: redhat.com Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-6.6 required=5.0 tests=BAYES_00, DKIMWL_WL_HIGH, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, NICE_REPLY_A, RCVD_IN_DNSWL_LOW, RCVD_IN_MSPIKE_H4, RCVD_IN_MSPIKE_WL, SPF_HELO_NONE, SPF_NONE, TXREP autolearn=ham autolearn_force=no version=3.4.4 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on server2.sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 19 Jul 2021 15:33:15 -0000 On 7/15/21 4:56 AM, Florian Weimer wrote: > * Paul Eggert: > >> Dumb question. Will C.UTF-8 have the same worst-case strcoll >> performance that en_US.UTF-8 does? I'm asking because I wonder whether >> we can recommend C.UTF-8 as a workaround for the strcoll performance >> bug, in cases where plain C is not appropriate. >> >> https://sourceware.org/bugzilla/show_bug.cgi?id=18441 >> >> https://debbugs.gnu.org/cgi/bugreport.cgi?bug=49340 > > I'm not sure if that advice is correct … Could you expand on this a bit more please? > With the new C.UTF-8 implementation, strcoll automatically switches to > strcmp, so it will be as fast as it can be. Agreed. -- Cheers, Carlos.