public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed
From: Richard Biener <rguenther@suse.de>
To: Uros Bizjak <ubizjak@gmail.com>
Cc: "gcc-patches@gcc.gnu.org" <gcc-patches@gcc.gnu.org>,
	 Jan Hubicka <hubicka@ucw.cz>,
	Hongtao Liu <hongtao.liu@intel.com>
Subject: Re: [RFC PATCH] i386: Do not sanitize upper part of V2SFmode reg with -fno-trapping-math [PR110832]
Date: Tue, 8 Aug 2023 08:07:57 +0000 (UTC)	[thread overview]
Message-ID: <nycvar.YFH.7.77.849.2308080758430.12935@jbgna.fhfr.qr> (raw)
In-Reply-To: <CAFULd4ZUXTDoAgGN_xi0tgWCm=gC5Vmd1nM+0KHa+MDPRC5V9A@mail.gmail.com>

On Mon, 7 Aug 2023, Uros Bizjak wrote:

> On Mon, Jul 31, 2023 at 11:40?AM Richard Biener <rguenther@suse.de> wrote:
> >
> > On Sun, 30 Jul 2023, Uros Bizjak wrote:
> >
> > > Also introduce -m[no-]mmxfp-with-sse option to disable trapping V2SF
> > > named patterns in order to avoid generation of partial vector V4SFmode
> > > trapping instructions.
> > >
> > > The new option is enabled by default, because even with sanitization,
> > > a small but consistent speed up of 2 to 3% with Polyhedron capacita
> > > benchmark can be achieved vs. scalar code.
> > >
> > > Using -fno-trapping-math improves Polyhedron capacita runtime 8 to 9%
> > > vs. scalar code.  This is what clang does by default, as it defaults
> > > to -fno-trapping-math.
> >
> > I like the new option, note you lack invoke.texi documentation where
> > I'd also elaborate a bit on the interaction with -fno-trapping-math
> > and the possible performance impact then NaNs or denormals leak
> > into the upper halves and cross-reference -mdaz-ftz.
> 
> The attached doc patch is invoke.texi entry for -mmmxfp-with-sse
> option. It is written in a way to also cover half-float vectors. WDYT?

"generate trapping floating-point operations"

I'd say "generate floating-point operations that might affect the
set of floating point status flags", the word "trapping" is IMHO 
misleading.
Not sure if "set of floating point status flags" is the correct term,
but it's what the C standard seems to refer to when talking about
things you get with fegetexceptflag.  feraieexcept refers to
"floating-point exceptions".  Unfortunately the -fno-trapping-math
documentation is similarly confusing (and maybe even wrong, I read
it to conform to 'non-stop' IEEE arithmetic).

I'd maybe give an example of a FP operation that's _not_ affected
by the flag (copysign?).

Otherwise it looks OK to me.

Thanks,
Richard.

  reply	other threads:[~2023-08-08  8:07 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-07-30 20:12 Uros Bizjak
2023-07-31  9:40 ` Richard Biener
2023-07-31 10:13   ` Uros Bizjak
2023-08-07 15:59   ` Uros Bizjak
2023-08-08  8:07     ` Richard Biener [this message]
2023-08-08  9:06       ` Uros Bizjak
2023-08-08 10:08         ` Richard Biener
2023-08-08 11:03           ` Uros Bizjak

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=nycvar.YFH.7.77.849.2308080758430.12935@jbgna.fhfr.qr \
    --to=rguenther@suse.de \
    --cc=gcc-patches@gcc.gnu.org \
    --cc=hongtao.liu@intel.com \
    --cc=hubicka@ucw.cz \
    --cc=ubizjak@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).