Re: [PATCH 0/2] Initial support for AVX512FP16

public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed

From: "H.J. Lu" <hjl.tools@gmail.com>
To: Uros Bizjak <ubizjak@gmail.com>
Cc: "gcc-patches@gcc.gnu.org" <gcc-patches@gcc.gnu.org>,
	Hongtao Liu <crazylht@gmail.com>,
	 Jakub Jelinek <jakub@redhat.com>,
	liuhongt <hongtao.liu@intel.com>
Subject: Re: [PATCH 0/2] Initial support for AVX512FP16
Date: Thu, 1 Jul 2021 05:39:59 -0700	[thread overview]
Message-ID: <CAMe9rOrvBeNmC6efzsBLVrW+PggCUGsM5fNNGKs7fiu-Ok994A@mail.gmail.com> (raw)
In-Reply-To: <CAFULd4YgVhLmCWhnnNFP7uuYFdS+ORkn2K12tKg_Eb3ozeaZ5w@mail.gmail.com>

On Thu, Jul 1, 2021 at 4:10 AM Uros Bizjak <ubizjak@gmail.com> wrote:
>
> [Sorry for double post, gcc-patches address was wrong in original post]
>
> On Thu, Jul 1, 2021 at 7:48 AM liuhongt <hongtao.liu@intel.com> wrote:
> >
> > Hi:
> >   AVX512FP16 is disclosed, refer to [1].
> >   There're 100+ instructions for AVX512FP16, 67 gcc patches, for the convenience of review, we divide the 67 patches into 2 major parts.
> >   The first part is 2 patches containing basic support for AVX512FP16 (options, cpuid, _Float16 type, libgcc, etc.), and the second part is 65 patches covering all instructions of AVX512FP16(including intrinsic support and some optimizations).
> >   There is a problem with the first part, _Float16 is not a C++ standard, so the front-end does not support this type and its mangling, so we "make up" a _Float16 type on the back-end and use _DF16 as its mangling. The purpose of this is to align with llvm side, because llvm C++ FE already supports _Float16[2].
> >
> > [1] https://software.intel.com/content/www/us/en/develop/download/intel-avx512-fp16-architecture-specification.html
> > [2] https://reviews.llvm.org/D33719
>
> Looking through implementation of _Float16 support, I think, there is
> no need for _Float16 support to depend on AVX512FP16.
>
> The compiler is smart enough to use either a named pattern that
> describes the instruction when available or diverts to a library call
> to a soft-fp implementation. So, I think that general _Float16 support
> should be implemented first (similar to _float128) and then upgraded
> with AVX512FP16 specific instructions.
>
> MOVW loads/stores to XMM reg can be emulated with MOVD and a SImode
> secondary_reload register.
>
> soft-fp library already includes all the infrastructure to implement
> _Float16 (see half.h), so HFmode basic operations should be trivial to
> implement (I have gone through this exercise personally years ago when
> implementing __float128 soft-fp support).
>
> Looking through the patch 1/2, it looks that a new ABI is introduced,
> where FP16 values are passed through XMM registers, but I don't think
> there is updated psABI documentation available (for x86_64 as well as

_Float16 support was added to x86-64 psABI:

https://gitlab.com/x86-psABIs/x86-64-ABI/-/commit/71d1183e7bb95e9f8ad732e0f2b5a4f127796e2a

2 years ago.

> i386, where FP16 values will probably be passed through memory).

That is correct.

> So, the net effect of the above proposal(s) is that x86 will support
> _Float16 out-of the box, emulate it via soft-fp without AVX512FP16 and
> use AVX512FP16 instructions with -mavx512fp16.
>

The main issue is complex _Float16 functions in libgcc.  If _Float16 doesn't
require -mavx512fp16, we need to compile complex _Float16 functions in
libgcc without -mavx512fp16.  Complex _Float16 performance is very
important for our _Float16 usage.   _Float16 performance has to be
very fast.  There should be no emulation anywhere when -mavx512fp16
is used.   That is why _Float16 is available only with -mavx512fp16.

-- 
H.J.

next prev parent reply	other threads:[~2021-07-01 12:40 UTC|newest]

Thread overview: 138+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <20210701054808.39000-1-hongtao.liu@intel.com>
2021-07-01  5:55 ` Hongtao Liu
2021-07-01 20:46   ` Joseph Myers
2021-07-06  8:53     ` Hongtao Liu
     [not found] ` <20210701054808.39000-3-hongtao.liu@intel.com>
2021-07-01  5:55   ` [PATCH 2/2] AVX512FP16: Add HFmode support in libgcc Hongtao Liu
     [not found] ` <20210701054808.39000-2-hongtao.liu@intel.com>
2021-07-01  5:55   ` [PATCH 1/2] AVX512FP16: Initial support for _Float16 type and AVX512FP16 feature Hongtao Liu
2021-07-01 11:10 ` [PATCH 0/2] Initial support for AVX512FP16 Uros Bizjak
2021-07-01 12:39   ` H.J. Lu [this message]
2021-07-01 12:58     ` Richard Biener
2021-07-01 13:03       ` Jakub Jelinek
2021-07-06  8:51         ` Hongtao Liu
2021-07-06 10:14           ` Richard Biener
2021-07-06 12:11             ` H.J. Lu
2021-07-06 18:20               ` Joseph Myers
2021-07-06 18:18             ` Joseph Myers
2021-07-06 18:11           ` Joseph Myers
2021-07-07  1:24             ` Hongtao Liu
2021-07-14  7:50               ` Hongtao Liu
2021-07-14 15:32                 ` [llvm-dev] " Craig Topper
2021-07-15  2:07                   ` Wang, Pengfei
2021-07-15  6:34                     ` Hongtao Liu
2021-07-15  6:57                       ` Wang, Pengfei
2021-07-15  7:49                         ` Hongtao Liu
2021-07-21  7:43       ` [PATCH V2 00/10] " liuhongt
2021-07-21  7:43         ` [PATCH 01/10] Update hf soft-fp from glibc liuhongt
2021-07-21  7:43         ` [PATCH 02/10] [i386] Enable _Float16 type for TARGET_SSE2 and above liuhongt
2021-07-21 10:35           ` Uros Bizjak
2021-07-22  5:21             ` Hongtao Liu
2021-07-22 11:56           ` Richard Biener
2021-07-28 21:56           ` Joseph Myers
2021-07-29  4:53             ` Hongtao Liu
2021-07-29  5:34               ` Hongtao Liu
2021-07-29 21:30               ` Joseph Myers
2021-08-02  5:23                 ` Hongtao Liu
2021-08-02  6:31                   ` [PATCH V3 0/6] Initial support for AVX512FP16 liuhongt
2021-08-02  6:31                     ` [PATCH 1/6] Update hf soft-fp from glibc liuhongt
2021-08-02  6:31                     ` [PATCH 2/6] [i386] Enable _Float16 type for TARGET_SSE2 and above liuhongt
2021-08-04  2:45                       ` Hongtao Liu
2021-08-04 11:28                         ` Richard Biener
2021-08-05  7:31                           ` Hongtao Liu
2021-08-05  7:39                             ` Hongtao Liu
2021-08-05  9:24                             ` Richard Biener
2021-08-05  9:49                               ` Hongtao Liu
2021-08-05 10:14                                 ` Richard Biener
2021-08-06  3:32                                   ` [PATCH] Make sure we're playing with integral modes before call extract_integral_bit_field liuhongt
2021-08-06  3:44                                     ` Andrew Pinski
2021-08-06  4:59                                       ` Hongtao Liu
2021-08-06  5:52                                         ` Hongtao Liu
2021-08-06  6:59                                         ` Richard Biener
2021-08-06  6:57                                     ` Richard Biener
2021-08-06  9:05                                       ` Richard Sandiford
2021-08-06 11:27                                         ` Richard Biener
2021-08-09  8:34                                           ` Hongtao Liu
2021-08-17  1:52                                             ` Hongtao Liu
2021-08-24  9:40                                               ` Hongtao Liu
2021-08-24  9:44                                                 ` Hongtao Liu
2021-08-24 11:38                                                   ` Richard Biener
2021-08-26  1:17                                                     ` Hongtao Liu
2021-08-25 23:16                                                   ` Jeff Law
2021-08-26  2:05                                                     ` Hongtao Liu
2021-08-26  7:11                                                     ` Richard Biener
2021-08-26  9:06                                                       ` Richard Sandiford
2021-08-26 10:14                                                         ` Richard Biener
2021-08-26 10:50                                                           ` Richard Sandiford
2021-08-26 11:09                                                             ` Richard Biener
2021-08-27  4:56                                                               ` Hongtao Liu
2021-08-30 19:09                                                                 ` Joseph Myers
2021-08-30 21:15                                                                   ` Jeff Law
2021-08-31  6:10                                                                 ` Richard Biener
2021-08-31  6:30                                                                   ` Hongtao Liu
2021-08-31  6:48                                                                     ` Hongtao Liu
2021-08-31 11:16                                                                       ` Richard Biener
2021-08-31 11:17                                                                       ` [PATCH 0/2] Get rid of all float-int special cases in validate_subreg liuhongt
2021-08-31 11:17                                                                         ` [PATCH 1/2] Revert "Make sure we're playing with integral modes before call extract_integral_bit_field." liuhongt
2021-08-31 11:17                                                                         ` [PATCH 2/2] Get rid of all float-int special cases in validate_subreg liuhongt
2021-08-31 11:57                                                                           ` Richard Biener
2021-09-02 17:55                                                                           ` Segher Boessenkool
2021-09-03 15:05                                                                             ` Andreas Schwab
2021-09-07 23:19                                                                               ` Segher Boessenkool
2021-09-08  0:55                                                                                 ` Hongtao Liu
2021-09-03 12:42                       ` [PATCH 2/6] [i386] Enable _Float16 type for TARGET_SSE2 and above Jakub Jelinek
2021-09-06  2:05                         ` Hongtao Liu
2021-09-06 12:13                           ` Jakub Jelinek
2021-09-07  1:52                             ` Hongtao Liu
2021-09-07  7:17                               ` Jakub Jelinek
2021-09-07 10:08                                 ` Hongtao Liu
2021-09-07 10:10                                   ` Jakub Jelinek
2021-08-02  6:31                     ` [PATCH 3/6] [i386] libgcc: Enable hfmode soft-sf/df/xf/tf extensions and truncations liuhongt
2021-08-02  6:31                     ` [PATCH 4/6] Support -fexcess-precision=16 which will enable FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when backend supports _Float16 liuhongt
2021-08-02 19:34                       ` Joseph Myers
2021-08-03  2:44                         ` Hongtao Liu
2021-08-06  6:06                           ` Hongtao Liu
2021-08-17  1:53                             ` Hongtao Liu
2021-08-24  9:39                               ` Hongtao Liu
2021-09-02  6:06                                 ` Hongtao Liu
2021-08-02  6:39                     ` [PATCH 6/6] AVX512FP16: Support vector init/broadcast/set/extract for FP16 liuhongt
2021-08-02  6:44                     ` [PATCH 5/6] AVX512FP16: Initial support for AVX512FP16 feature and scalar _Float16 instructions liuhongt
2021-08-04  2:40                       ` Hongtao Liu
2021-08-04  9:55                       ` Uros Bizjak
2021-09-02  6:06                     ` [PATCH V3 0/6] Initial support for AVX512FP16 Hongtao Liu
2021-09-02 11:30                       ` Iain Sandoe
2021-09-02 15:18                         ` Hongtao Liu
2021-09-02 16:44                           ` Iain Sandoe
2021-09-02 20:03                             ` Joseph Myers
2021-09-03  7:51                               ` Iain Sandoe
2021-09-03 15:33                                 ` Iain Sandoe
2021-09-21 20:11                                   ` Joseph Myers
2021-09-21 20:25                                     ` Iain Sandoe
2021-09-22  7:08                                     ` Iain Sandoe
2021-09-22 19:50                                       ` Joseph Myers
2021-09-02 15:30                       ` H.J. Lu
2021-09-02 15:50                         ` Hongtao Liu
2021-09-02 19:45                       ` Joseph Myers
2021-07-21  7:43         ` [PATCH 03/10] [i386] libgcc: Enable hfmode soft-sf/df/xf/tf extensions and truncations liuhongt
2021-07-21 10:51           ` Uros Bizjak
2021-07-22 12:14           ` Richard Biener
2021-07-27  5:32             ` Hongtao Liu
2021-07-29 20:57               ` Joseph Myers
2021-08-02  5:10                 ` Hongtao Liu
2021-07-21  7:43         ` [PATCH 04/10] AVX512FP16: Initial support for AVX512FP16 feature and scalar _Float16 instructions liuhongt
2021-07-22  8:49           ` Uros Bizjak
2021-07-27  7:31             ` Hongtao Liu
2021-07-21  7:43         ` [PATCH 05/10] AVX512FP16: Support vector init/broadcast/set/extract for FP16 liuhongt
2021-07-22  5:24           ` Hongtao Liu
2021-07-21  7:43         ` [PATCH 06/10] AVX512FP16: Add testcase for vector init and broadcast intrinsics liuhongt
2021-07-21  7:43         ` [PATCH 07/10] AVX512FP16: Add tests for vector passing in variable arguments liuhongt
2021-07-21  7:43         ` [PATCH 08/10] AVX512FP16: Add ABI tests for xmm liuhongt
2021-07-21  7:43         ` [PATCH 09/10] AVX512FP16: Add ABI test for ymm liuhongt
2021-07-21  7:43         ` [PATCH 10/10] AVX512FP16: Add abi test for zmm liuhongt
2021-09-08  2:54         ` [PATCH V2 00/10] Initial support for AVX512FP16 Hongtao Liu
2021-09-08  3:02           ` Hongtao Liu
2021-07-01 12:58     ` [PATCH 0/2] " Uros Bizjak
2021-07-01 21:40     ` Joseph Myers
2021-07-02  6:30   ` Hongtao Liu
2021-07-02  8:03     ` Uros Bizjak
2021-07-02  8:19       ` Richard Biener
2021-07-03 14:44         ` Hongtao Liu
2021-07-05  1:25       ` Hongtao Liu
2021-07-05 11:02         ` Richard Biener

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=CAMe9rOrvBeNmC6efzsBLVrW+PggCUGsM5fNNGKs7fiu-Ok994A@mail.gmail.com \
    --to=hjl.tools@gmail.com \
    --cc=crazylht@gmail.com \
    --cc=gcc-patches@gcc.gnu.org \
    --cc=hongtao.liu@intel.com \
    --cc=jakub@redhat.com \
    --cc=ubizjak@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).