public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed
From: Richard Biener <rguenther@suse.de>
To: Hongtao Liu <crazylht@gmail.com>
Cc: gcc-patches@gcc.gnu.org
Subject: Re: [PATCH][RFC] middle-end/110237 - wrong MEM_ATTRs for partial loads/stores
Date: Mon, 26 Jun 2023 08:41:59 +0000 (UTC)	[thread overview]
Message-ID: <nycvar.YFH.7.77.849.2306260836480.4723@jbgna.fhfr.qr> (raw)
In-Reply-To: <CAMZc-bzqryvft34cJsTYV+JxB1Cr4B53_Aqa_hevURc897K14g@mail.gmail.com>

On Mon, 26 Jun 2023, Hongtao Liu wrote:

> On Wed, Jun 21, 2023 at 3:49?PM Richard Biener via Gcc-patches
> <gcc-patches@gcc.gnu.org> wrote:
> >
> > The following addresses a miscompilation by RTL scheduling related
> > to the representation of masked stores.  For that we have
> >
> > (insn 38 35 39 3 (set (mem:V16SI (plus:DI (reg:DI 40 r12 [orig:90 _22 ] [90])
> >                 (const:DI (plus:DI (symbol_ref:DI ("b") [flags 0x2]  <var_decl 0x7ffff6e28d80 b>)
> >                         (const_int -4 [0xfffffffffffffffc])))) [1 MEM <vector(16) int> [(int *)vectp_b.12_28]+0 S64 A32])
> >         (vec_merge:V16SI (reg:V16SI 20 xmm0 [118])
> >             (mem:V16SI (plus:DI (reg:DI 40 r12 [orig:90 _22 ] [90])
> >                     (const:DI (plus:DI (symbol_ref:DI ("b") [flags 0x2]  <var_decl 0x7ffff6e28d80 b>)
> >                             (const_int -4 [0xfffffffffffffffc])))) [1 MEM <vector(16) int> [(int *)vectp_b.12_28]+0 S64 A32])
> >
> > and specifically the memory attributes
> >
> >   [1 MEM <vector(16) int> [(int *)vectp_b.12_28]+0 S64 A32]
> >
> > are problematic.  They tell us the instruction stores and reads a full
> > vector which it if course does not.  There isn't any good MEM_EXPR
> > we can use here (we lack a way to just specify a pointer and restrict
> > info for example), and since the MEMs have a vector mode it's
> > difficult in general as passes do not need to look at the memory
> > attributes at all.
> >
> > The easiest way to avoid running into the alias analysis problem is
> > to scrap the MEM_EXPR when we expand the internal functions for
> > partial loads/stores.  That avoids the disambiguation we run into
> > which is realizing that we store to an object of less size as
> > the size of the mode we appear to store.
> >
> > After the patch we see just
> >
> >   [1  S64 A32]
> >
> > so we preserve the alias set, the alignment and the size (the size
> > is redundant if the MEM insn't BLKmode).  That's still not good
> > in case the RTL alias oracle would implement the same
> > disambiguation but it fends off the gimple one.
> >
> > This fixes gcc.dg/torture/pr58955-2.c when built with AVX512
> > and --param=vect-partial-vector-usage=1.
> >
> > On the MEM_EXPR side we could use a CALL_EXPR and on the RTL
> > side we might instead want to use a BLKmode MEM?  Any better
> > ideas here?
> Can we introduce a new member in class mem_attrs and ao_ref, initial
> the member (named partial_access_p) in expand_partial_load_optab_fn
> and expand_partial_store_optab_fn, pass partial_access_p from
> mem_attrs to ao_ref in djust ao_ref_from_mem.
> It looks to me we only want to avoid the below rule in alias analysis.
> For others, size, max_size, offset is still meaningful, even for
> rtx_addr_can_trap_p, if size can't trap, partial access must not trap?

The GIMPLE oracle makes sure to set 'size' to -1 (not known) when
seeing the IFNs for masked stores/loads.  To fix the RTL MEM_EXPR
side I would rather try putting a CALL_EXPR there, preserving the
masked internal function, instead of using a mem-ref and additional
info.

That leaves the MEM RTX itself - for a MEM with non-BLKmode I thin
MEM_ATTRS are completely optional and it's OK to drop them, we'd
have to special case the partial MEM RTX then.  Jeff agreed that
eventually using BLKmode for them would be "OK", then we can
make MEM_SIZE unknown as well.

I think the issue is latent on branches so I first wanted to find
some minimal change to mitigate the miscompile and then maybe try
options to not lose most of the alias info here.

Richard.

>   /* If the pointer based access is bigger than the variable they cannot
>      alias.  This is similar to the check below where we use TBAA to
>      increase the size of the pointer based access based on the dynamic
>      type of a containing object we can infer from it.  */
>   poly_int64 dsize2;
>   if (known_size_p (size1) --- should be unknown??
>       && poly_int_tree_p (DECL_SIZE (base2), &dsize2)
>       && known_lt (dsize2, size1))
>     return false;
> 
> >
> > Thanks,
> > Richard.
> >
> >         PR middle-end/110237
> >         * internal-fn.cc (expand_partial_load_optab_fn): Clear
> >         MEM_EXPR and MEM_OFFSET.
> >         (expand_partial_store_optab_fn): Likewise.
> > ---
> >  gcc/internal-fn.cc | 8 ++++++++
> >  1 file changed, 8 insertions(+)
> >
> > diff --git a/gcc/internal-fn.cc b/gcc/internal-fn.cc
> > index c911ae790cb..2dc685e7d85 100644
> > --- a/gcc/internal-fn.cc
> > +++ b/gcc/internal-fn.cc
> > @@ -2903,6 +2903,10 @@ expand_partial_load_optab_fn (internal_fn, gcall *stmt, convert_optab optab)
> >
> >    mem = expand_expr (rhs, NULL_RTX, VOIDmode, EXPAND_WRITE);
> >    gcc_assert (MEM_P (mem));
> > +  /* The built MEM_REF does not accurately reflect that the load
> > +     is only partial.  Clear it.  */
> > +  set_mem_expr (mem, NULL_TREE);
> > +  clear_mem_offset (mem);
> >    mask = expand_normal (maskt);
> >    target = expand_expr (lhs, NULL_RTX, VOIDmode, EXPAND_WRITE);
> >    create_output_operand (&ops[0], target, TYPE_MODE (type));
> > @@ -2971,6 +2975,10 @@ expand_partial_store_optab_fn (internal_fn, gcall *stmt, convert_optab optab)
> >
> >    mem = expand_expr (lhs, NULL_RTX, VOIDmode, EXPAND_WRITE);
> >    gcc_assert (MEM_P (mem));
> > +  /* The built MEM_REF does not accurately reflect that the store
> > +     is only partial.  Clear it.  */
> > +  set_mem_expr (mem, NULL_TREE);
> > +  clear_mem_offset (mem);
> >    mask = expand_normal (maskt);
> >    reg = expand_normal (rhs);
> >    create_fixed_operand (&ops[0], mem);
> > --
> > 2.35.3
> 
> 
> 
> 

-- 
Richard Biener <rguenther@suse.de>
SUSE Software Solutions Germany GmbH, Frankenstrasse 146, 90461 Nuernberg,
Germany; GF: Ivo Totev, Andrew Myers, Andrew McDonald, Boudien Moerman;
HRB 36809 (AG Nuernberg)

  reply	other threads:[~2023-06-26  8:42 UTC|newest]

Thread overview: 15+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <20230621074956.1174B3858288@sourceware.org>
2023-06-26  8:29 ` Hongtao Liu
2023-06-26  8:41   ` Richard Biener [this message]
     [not found] <20230621075019.7CA813858033@sourceware.org>
2023-11-27 12:39 ` Robin Dapp
2023-11-27 15:45   ` Jeff Law
2023-11-28  7:50     ` Richard Biener
2023-11-28 10:31       ` Richard Sandiford
2023-11-28 11:21         ` Richard Biener
2023-11-28 11:32           ` Richard Sandiford
2023-11-28 12:17             ` Richard Biener
2023-11-28 15:00       ` Jeff Law
2023-11-29  7:16         ` Richard Biener
     [not found] <20230621074951.F3C3C3858433@sourceware.org>
2023-06-21 15:29 ` Jeff Law
2023-06-22  6:39   ` Richard Biener
2023-06-24 14:32     ` Jeff Law
2023-06-21  7:49 Richard Biener

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=nycvar.YFH.7.77.849.2306260836480.4723@jbgna.fhfr.qr \
    --to=rguenther@suse.de \
    --cc=crazylht@gmail.com \
    --cc=gcc-patches@gcc.gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).