From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: by sourceware.org (Postfix, from userid 48) id 190C1385843A; Wed, 21 Dec 2022 09:29:24 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 190C1385843A DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1671614964; bh=wVpgqV/+vDNkVHVJlfRz/ueX5C8KwaciiaMKJMKTdq0=; h=From:To:Subject:Date:In-Reply-To:References:From; b=KFdViMVpIpU3B48w/G7P4bCuMxSekq18Jm12OpkS6+mZXZpW8tMfeW2dG7/4qMcE/ PUgzYgNJ8vhG2vgdMM30QbLLEN0YOyEFnK1/IgK6jyBNC7DrwIWX8MXNJ2MDrnsYK5 25dC0n31e+cKptDpoSeMwmRxxq++6T5uJmPnk0qc= From: "luoyonggang at gmail dot com" To: gcc-bugs@gcc.gnu.org Subject: [Bug target/108191] Add support to usage of *intrin.h without -mavx512f -mavx512cd Date: Wed, 21 Dec 2022 09:29:23 +0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: gcc X-Bugzilla-Component: target X-Bugzilla-Version: 13.0 X-Bugzilla-Keywords: X-Bugzilla-Severity: enhancement X-Bugzilla-Who: luoyonggang at gmail dot com X-Bugzilla-Status: RESOLVED X-Bugzilla-Resolution: WONTFIX X-Bugzilla-Priority: P3 X-Bugzilla-Assigned-To: unassigned at gcc dot gnu.org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 List-Id: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=3D108191 --- Comment #6 from =E7=BD=97=E5=8B=87=E5=88=9A(Yonggang Luo) --- Is the following command are valid usage? It's compiled properly ``` // compile args: -fPIC -O2 -D__SSE3__=3D1 -D__SSSE3__=3D1 -D__SSE4_1__=3D1 -D__SSE4_2__=3D1 -D__SSE4A__=3D1 -D__POPCNT__=3D1 -D__XSAVE__=3D1 -D__CRC32= __=3D1 -D__AVX__=3D1 -D__AVX2__=3D1 -D__FP_FAST_FMAF32=3D1 -D__FP_FAST_FMAF64=3D1 -D__FP_FAST_FMAF=3D1 -D__FP_FAST_FMAF32x=3D1 -D__AVX512F__=3D1 -D__AVX512CD= __=3D1 #include #pragma GCC push_options #pragma GCC target("avx512f") #pragma GCC target("avx512cd") #pragma GCC target("sse4a") #if defined(_MSC_VER) #include #else #include #endif #pragma GCC pop_options #pragma GCC push_options #pragma GCC target("avx512f") #pragma GCC target("avx512cd") #pragma GCC target("sse4a") void util_fadd_512(float *a, float *b, float *c) { /* a =3D b + c */ __m512 av =3D _mm512_load_ps(a); __m512 bv =3D _mm512_load_ps(b); __m512 cv =3D _mm512_add_ps(av, bv); _mm512_store_ps(c, cv); } static inline int util_iround(float f) { __m128 m =3D _mm_set_ss(f); return _mm_cvtss_i32(m); } #pragma GCC pop_options int util_iround_outside(int x, float y) { return x + util_iround(y); } float util_fadd(float a, float b) { return a + b; } ```=