From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-pg1-x52b.google.com (mail-pg1-x52b.google.com [IPv6:2607:f8b0:4864:20::52b]) by sourceware.org (Postfix) with ESMTPS id 80688385737B for ; Fri, 24 Jun 2022 17:15:54 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 80688385737B Received: by mail-pg1-x52b.google.com with SMTP id h192so2997334pgc.4 for ; Fri, 24 Jun 2022 10:15:54 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=+Bdoawq9ONo6Ys1wP0EV8XUrK/OjGp1AbEzw9hbCuSE=; b=FO8g1MUy49GA1efv841GNhGU8vP+GZx8IzNCAcbvNZC0bgS58clE5BlD5zrTIkqTR2 DW9la4ll2SDDoLNox+Hr462Bfgr7A7eLs0Gj2qrnZCCM+vPogju52egdkiHu4rVcT4+M XqNPMsuYtVxV3mCfT/kfGAGIpFDtOlZ9IeujgCXQp/irfIXnHytIdlb0PXPtWh1VG6/F /3SL3RWflywAD+4Qmrt8o832Eqr/rGSkaU7A7Qx2kIFvqayP6Frk3REJb2kaFNtJ7PXT 8obxemPJwPxnEVRIgs9cvkUPUz+BpSrbHv14PneY9irWWZ3hLghoVFOlkxwyhzL/JnMY St5w== X-Gm-Message-State: AJIora9ZTZOecOEsNAJ4zWWUFBC9YHSW6OD2yhX7QiOadWmd9zYzmhYT ZD/3uWV0GasMh4DRGtB5+qMH6IL/qrX/Rpsec1xUqZNj X-Google-Smtp-Source: AGRyM1si7B9EhTIZAfXtxysGZNjAnDeGl46Yz7ZBaRh6HAZADTpq4di2BwqjKnG43goLdqouCAwdJ7bbc5zZWVjZAmI= X-Received: by 2002:a63:b54c:0:b0:40c:7b84:4f7f with SMTP id u12-20020a63b54c000000b0040c7b844f7fmr12592027pgo.586.1656090952765; Fri, 24 Jun 2022 10:15:52 -0700 (PDT) MIME-Version: 1.0 References: <20220624164216.2129400-1-goldstein.w.n@gmail.com> In-Reply-To: <20220624164216.2129400-1-goldstein.w.n@gmail.com> From: "H.J. Lu" Date: Fri, 24 Jun 2022 10:15:17 -0700 Message-ID: Subject: Re: [PATCH v2] x86: Align entry for memrchr to 64-bytes. To: Noah Goldstein Cc: GNU C Library , "Carlos O'Donell" Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-3025.1 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, FREEMAIL_FROM, GIT_PATCH_0, RCVD_IN_DNSWL_NONE, SPF_HELO_NONE, SPF_PASS, TXREP, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 24 Jun 2022 17:15:56 -0000 On Fri, Jun 24, 2022 at 9:42 AM Noah Goldstein wrote: > > The function was tuned around 64-byte entry alignment and performs > better for all sizes with it. > > As well different code boths where explicitly written to touch the > minimum number of cache line i.e sizes <= 32 touch only the entry > cache line. > --- > sysdeps/x86_64/multiarch/memrchr-avx2.S | 2 +- > 1 file changed, 1 insertion(+), 1 deletion(-) > > diff --git a/sysdeps/x86_64/multiarch/memrchr-avx2.S b/sysdeps/x86_64/multiarch/memrchr-avx2.S > index 9c83c76d3c..f300d7daf4 100644 > --- a/sysdeps/x86_64/multiarch/memrchr-avx2.S > +++ b/sysdeps/x86_64/multiarch/memrchr-avx2.S > @@ -35,7 +35,7 @@ > # define VEC_SIZE 32 > # define PAGE_SIZE 4096 > .section SECTION(.text), "ax", @progbits > -ENTRY(MEMRCHR) > +ENTRY_P2ALIGN(MEMRCHR, 6) > # ifdef __ILP32__ > /* Clear upper bits. */ > and %RDX_LP, %RDX_LP > -- > 2.34.1 > LGTM. Thanks. -- H.J.