From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-pj1-x102a.google.com (mail-pj1-x102a.google.com [IPv6:2607:f8b0:4864:20::102a]) by sourceware.org (Postfix) with ESMTPS id F122C3857705 for ; Wed, 5 Apr 2023 16:21:53 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org F122C3857705 Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=gmail.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=gmail.com Received: by mail-pj1-x102a.google.com with SMTP id p3-20020a17090a74c300b0023f69bc7a68so37806251pjl.4 for ; Wed, 05 Apr 2023 09:21:53 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; t=1680711712; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:to:from:from:to:cc:subject:date:message-id :reply-to; bh=mhPpohPFRZ02Rh3pcEJ+wtYaHkSimD2UPxcUdwlz84Y=; b=D12uWS1vuRy4WDMtgCyNgTlRGkv4R84t8JydgAgTx70vRorqQZoaRMQXsAxdDxLSdA vdeZl5V3AuDR2XGn5RWUuwLEIXe4xIt5rx9bWSjqy2CADegxQm39WcCCtb4/K0DAw9B/ /FmoQj1AJFoigL3Y5bNss6mbl/XsctSRz9WGFAe/fw+dQpznKt8rM/AzGIwex3X/so7e fTYAWaR1NvAAMJsYDq58fbAewHvT4X0hpjaBQ3n//T2Iyo2lyUIhlyKSaifrnLCBwpVS M7kay3QOld6Rjcl8uirIglXB6k/0vsMm8gmK/+odFh+4623P47wPNt3pfmWjGFJw0BMm eZ4w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; t=1680711712; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=mhPpohPFRZ02Rh3pcEJ+wtYaHkSimD2UPxcUdwlz84Y=; b=HqLQFrygPkH8Tl4DScBwV0pNNDraRbyQpXeDohxaPcmMHwB2MT5ZbGshftwQ217FeU epwGnHJQVT9R1g+kz7RQTI7NxX/wvKkQ0kcs7aoULFGt9BA44KLJM6d47VRPJfPvyHuT eFF9KYP/FbTFzN3K0I/7qd4S1X3avRxfomo0uJVG+1a/F07gmQ07IUaVdnblw/qqWwLk S8tduRV3b9mHqwf91slfrMvoXsaQW6JKnBD3gfAW7wzStC7XAnhfVyWd4kVguSAdFzw/ 8WVNJ9vvoT2XJ9QMHf6NRSJzlVdJg4BzI+JD/2XTt3fGy7aI8O/09g1FOl6q3dTuMpL2 zxcA== X-Gm-Message-State: AAQBX9cfyMFVjiQAhuKmkI7+9it7FSCxOEdUCcsngX44QYRidctAmqEH c3Cp/xNybNVlmgK2SZcx9+m4xJP6vf4= X-Google-Smtp-Source: AKy350aRD6pIBwVygEdhx/kcE9qmTpYTgBxdsvv/hawG6KAGTfSayuRJ31cD/ATE7WcW+gyR+1ENNg== X-Received: by 2002:a17:903:2288:b0:19e:bfec:7928 with SMTP id b8-20020a170903228800b0019ebfec7928mr8452081plh.24.1680711711639; Wed, 05 Apr 2023 09:21:51 -0700 (PDT) Received: from gnu-cfl-3.localdomain ([172.59.161.240]) by smtp.gmail.com with ESMTPSA id jm18-20020a17090304d200b001a0428bd8c4sm10257011plb.289.2023.04.05.09.21.48 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 05 Apr 2023 09:21:50 -0700 (PDT) Received: from gnu-cfl-3.. (localhost [IPv6:::1]) by gnu-cfl-3.localdomain (Postfix) with ESMTP id 070D1740146 for ; Wed, 5 Apr 2023 09:21:45 -0700 (PDT) From: "H.J. Lu" To: libc-alpha@sourceware.org Subject: [PATCH 13/19] : Add AMX-FP16 support Date: Wed, 5 Apr 2023 09:21:38 -0700 Message-Id: <20230405162144.984598-14-hjl.tools@gmail.com> X-Mailer: git-send-email 2.39.2 In-Reply-To: <20230405162144.984598-1-hjl.tools@gmail.com> References: <20230405162144.984598-1-hjl.tools@gmail.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-3025.5 required=5.0 tests=BAYES_00,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM,GIT_PATCH_0,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS,TXREP autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org List-Id: Add AMX-FP16 support to . --- manual/platform.texi | 3 +++ sysdeps/x86/bits/platform/x86.h | 1 + sysdeps/x86/cpu-features.c | 2 ++ sysdeps/x86/include/cpu-features.h | 3 +++ sysdeps/x86/tst-get-cpu-features.c | 2 ++ 5 files changed, 11 insertions(+) diff --git a/manual/platform.texi b/manual/platform.texi index af79f5eb4d..7d4aa3d339 100644 --- a/manual/platform.texi +++ b/manual/platform.texi @@ -200,6 +200,9 @@ The supported processor features are: @item @code{AMX_INT8} -- Tile computational operations on 8-bit numbers. +@item +@code{AMX_FP16} -- Tile computational operations on FP16 numbers. + @item @code{AMX_TILE} -- Tile architecture. diff --git a/sysdeps/x86/bits/platform/x86.h b/sysdeps/x86/bits/platform/x86.h index 2a15ad937a..2776c69b16 100644 --- a/sysdeps/x86/bits/platform/x86.h +++ b/sysdeps/x86/bits/platform/x86.h @@ -298,6 +298,7 @@ enum x86_cpu_FSRS = x86_cpu_index_7_ecx_1_eax + 11, x86_cpu_FSRCS = x86_cpu_index_7_ecx_1_eax + 12, x86_cpu_WRMSRNS = x86_cpu_index_7_ecx_1_eax + 19, + x86_cpu_AMX_FP16 = x86_cpu_index_7_ecx_1_eax + 21, x86_cpu_HRESET = x86_cpu_index_7_ecx_1_eax + 22, x86_cpu_LAM = x86_cpu_index_7_ecx_1_eax + 26, diff --git a/sysdeps/x86/cpu-features.c b/sysdeps/x86/cpu-features.c index da04ad0b00..6c1b5efc5f 100644 --- a/sysdeps/x86/cpu-features.c +++ b/sysdeps/x86/cpu-features.c @@ -213,6 +213,8 @@ update_active (struct cpu_features *cpu_features) CPU_FEATURE_SET_ACTIVE (cpu_features, AMX_TILE); /* Determine if AMX_INT8 is usable. */ CPU_FEATURE_SET_ACTIVE (cpu_features, AMX_INT8); + /* Determine if AMX_FP16 is usable. */ + CPU_FEATURE_SET_ACTIVE (cpu_features, AMX_FP16); } /* These features are usable only when OSXSAVE is enabled. */ diff --git a/sysdeps/x86/include/cpu-features.h b/sysdeps/x86/include/cpu-features.h index 4e40fe0482..07c841c1d4 100644 --- a/sysdeps/x86/include/cpu-features.h +++ b/sysdeps/x86/include/cpu-features.h @@ -309,6 +309,7 @@ enum #define bit_cpu_FZLRM (1u << 10) #define bit_cpu_FSRS (1u << 11) #define bit_cpu_FSRCS (1u << 12) +#define bit_cpu_AMX_FP16 (1u << 21) #define bit_cpu_HRESET (1u << 22) #define bit_cpu_LAM (1u << 26) @@ -546,6 +547,7 @@ enum #define index_cpu_FZLRM CPUID_INDEX_7_ECX_1 #define index_cpu_FSRS CPUID_INDEX_7_ECX_1 #define index_cpu_FSRCS CPUID_INDEX_7_ECX_1 +#define index_cpu_AMX_FP16 CPUID_INDEX_7_ECX_1 #define index_cpu_HRESET CPUID_INDEX_7_ECX_1 #define index_cpu_LAM CPUID_INDEX_7_ECX_1 @@ -783,6 +785,7 @@ enum #define reg_FZLRM eax #define reg_FSRS eax #define reg_FSRCS eax +#define reg_AMX_FP16 eax #define reg_HRESET eax #define reg_LAM eax diff --git a/sysdeps/x86/tst-get-cpu-features.c b/sysdeps/x86/tst-get-cpu-features.c index 9c436eaa64..c0f222cb77 100644 --- a/sysdeps/x86/tst-get-cpu-features.c +++ b/sysdeps/x86/tst-get-cpu-features.c @@ -210,6 +210,7 @@ do_test (void) CHECK_CPU_FEATURE_PRESENT (FSRS); CHECK_CPU_FEATURE_PRESENT (FSRCS); CHECK_CPU_FEATURE_PRESENT (WRMSRNS); + CHECK_CPU_FEATURE_PRESENT (AMX_FP16); CHECK_CPU_FEATURE_PRESENT (HRESET); CHECK_CPU_FEATURE_PRESENT (LAM); CHECK_CPU_FEATURE_PRESENT (AESKLE); @@ -377,6 +378,7 @@ do_test (void) CHECK_CPU_FEATURE_ACTIVE (FZLRM); CHECK_CPU_FEATURE_ACTIVE (FSRS); CHECK_CPU_FEATURE_ACTIVE (FSRCS); + CHECK_CPU_FEATURE_ACTIVE (AMX_FP16); CHECK_CPU_FEATURE_ACTIVE (AESKLE); CHECK_CPU_FEATURE_ACTIVE (WIDE_KL); CHECK_CPU_FEATURE_ACTIVE (PTWRITE); -- 2.39.2