From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-il1-x141.google.com (mail-il1-x141.google.com [IPv6:2607:f8b0:4864:20::141]) by sourceware.org (Postfix) with ESMTPS id DFAB13894C00 for ; Tue, 1 Sep 2020 19:23:59 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.3.2 sourceware.org DFAB13894C00 Received: by mail-il1-x141.google.com with SMTP id h11so2390507ilj.11 for ; Tue, 01 Sep 2020 12:23:59 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=RYPAXTzyAhZV+3xQ1lyHcRt+8l0fVzeheDvKtQiBGfk=; b=Q+4yc+VAdftEN5aMRFvvOquxBNSZ/v2LQM+8FGnIbY9EwAsZCT/66X7t2u539OEyZK dWZ0UWCDdh9NBo0C8SkpBItTpXp/XV96bLn2Og7f5KSNybPOUSezuTJaN9SKJBvzpaJR f4rDsA7yEgAMF/BVSQBARTbZi9RwvvhYaLPYd/mA54WU8x0S2HBSMtJlvVFj/3PdDQtZ eyWt/BixrvE8rRwG5ibvTx71RtRK2D0dvHicVx/zZjX+tjpMj0HyuPlm0BQUfJhYHmWD EP5PfTmhvkuCBxr/pUdsHxHHkJKoJzVYGbprgnnoeEjIVwBiF58VXaD401k0F82XXIXC UWPA== X-Gm-Message-State: AOAM533m9R7X+tbyy+P/m7zwFmZn1n317WAhtJuD+Uqsm71YvmQZ0CAP +o7Fn7tYd+bf3l9H4Uo4VMtj7qh4IoHunMRdooIs2tAqGdA= X-Google-Smtp-Source: ABdhPJzXsZcKVbHfmyPkTBOujAUl4xg+ljcjeXmOeKwFkl4huEC5L5wGj1WBnKgQnsPxbLff3RU6aHbH6yWF2Jw8kSE= X-Received: by 2002:a92:6a09:: with SMTP id f9mr500870ilc.273.1598988239243; Tue, 01 Sep 2020 12:23:59 -0700 (PDT) MIME-Version: 1.0 References: <20200819104539.9854-1-sajan.karumanchi@amd.com> <20200819104539.9854-2-sajan.karumanchi@amd.com> In-Reply-To: <20200819104539.9854-2-sajan.karumanchi@amd.com> From: "H.J. Lu" Date: Tue, 1 Sep 2020 12:23:23 -0700 Message-ID: Subject: Re: [PATCH 1/1] x86: Tuning NT Threshold parameter for AMD machines. To: Sajan Karumanchi Cc: GNU C Library , "Carlos O'Donell" , Sajan Karumanchi , "Mallappa, Premachandra" Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-3.1 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, FREEMAIL_FROM, RCVD_IN_DNSWL_NONE, SPF_HELO_NONE, SPF_PASS, TXREP autolearn=ham autolearn_force=no version=3.4.2 X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on server2.sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 01 Sep 2020 19:24:01 -0000 On Wed, Aug 19, 2020 at 3:58 AM Sajan Karumanchi via Libc-alpha wrote: > > Tuning NT threshold parameter to bring in performance gains of > memcpy/memove on AMD cpu's. > > Based on Large and Walk bench variant results, > setting __x86_shared_non_temporal_threshold to 2/3 of shared cache size > brings in performance gains for memcpy/memmove on AMD machines. > The patch looks mostly OK. But I have quite a few x86 patches queued which touch the same codes. Please take a look at https://gitlab.com/x86-glibc/glibc/-/commits/users/hjl/tunable/master and put your patch on top of mine. -- H.J.