Re: [PATCH] middle-end: Call negate_rtx instead of simplify_gen_unary expanding rotate shift [PR113033]

public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed

From: Jakub Jelinek <jakub@redhat.com>
To: Xi Ruoyao <xry111@xry111.site>
Cc: Jeff Law <jeffreyalaw@gmail.com>,
	gcc-patches@gcc.gnu.org, chenglulu <chenglulu@loongson.cn>,
	i@xen0n.name, xuchenghua@loongson.cn, c@jia.je
Subject: Re: [PATCH] middle-end: Call negate_rtx instead of simplify_gen_unary expanding rotate shift [PR113033]
Date: Mon, 18 Dec 2023 21:15:51 +0100	[thread overview]
Message-ID: <ZYCod5F4cTUVSfqu@tucnak> (raw)
In-Reply-To: <ab5d92793e6a3561f758bddffebaa7c6b4076be5.camel@xry111.site>

On Tue, Dec 19, 2023 at 04:01:52AM +0800, Xi Ruoyao wrote:
> On Mon, 2023-12-18 at 18:45 +0100, Jakub Jelinek wrote:
> > On Tue, Dec 19, 2023 at 12:48:46AM +0800, Xi Ruoyao wrote:
> > > > > gcc/ChangeLog:
> > > > > 
> > > > > 	PR middle-end/113033
> > > > > 	* expmed.cc (expand_shift_1): When expanding rotate shift, call
> > > > > 	negate_rtx instead of simplify_gen_unary (NEG, ...).
> > > 
> > > > The key difference being that using negate_rtx will go through the
> > > > expander which knows how to synthesize negation whereas 
> > > > simplify_gen_unary will just generate a (neg ...) and assume it matches 
> > > > something in the backend, right?
> > > 
> > > For PR113033 the key difference (to me) is negate_rtx emits an insn to
> > > set a new pseudo reg to -x.  So the result will be
> > > 
> > > (set (reg:SI 81) (neg:SI (reg:SI 80)))
> > > 
> > > then
> > > 
> > > (and (reg:SI 81) (const_int 31))
> > > 
> > > instead of a consolidated
> > > 
> > > (and:SI (neg:SI (reg:SI IN)) (const_int 63))
> > > 
> > > AFAIK no backends have an instruction doing "negate an operand then and
> > > bitwisely".
> > 
> > Can you explain why it doesn't work as is though?
> > I mean, expand_shift_1 with that (and (neg (reg ...)) (const_int ...))
> > should try to legitimize the operand (e.g. in maybe_legitimize_operand
> > -> force_operand and force_operand should be able to deal with that,
> > AND is binary op, so it recurses on the 2 operands and NEG is UNARY_P,
> > so the recursion should deal with that if it is not general_operand.
> 
> It happens with vector left rotate:
> 
> V test (V a, int x)
> {
>   int _1; 
>   V _4; 
> 
>   <bb 2> [local count: 1073741824]:
>   _1 = x_2(D) & 31; 
>   _4 = a_3(D) r<< _1; 
>   return _4; 
> 
> }
> 
> Here V is in V4SImode.  With other_amount = (and (neg (reg 85))
> (const_int 31)), we end up calling
> 
> expand_shift_1 (
>   code = RSHIFT_EXPR,
>   mode = V4SImode,
>   shifted = (reg:V4SI 82),
>   amount = (and:SI (neg:SI (reg:SI 85)) (const_int 31)),
>   target = (reg:V4SI 84),
>   unsignedp = true,
>   may_fail = false)
> 
> It then calls
> 
> expand_binop (
>   mode = V4SImode,
>   lshr_optab,
>   op0 = (reg:V4SI 82),
>   op1 = (and:SI (neg:SI (reg:SI 85)) (const_int 31)),
>   target = (reg:V4SI 84),
>   unsignedp=1,
>   methods=OPTAB_DIRECT)
> 
> In expand_binop:
> 
> rtx vop1 = expand_vector_broadcast (mode, op1);
> 
> LoongArch backend don't have vec_duplicate (well, broadcasting is
> implemented as a special case of vec_init and maybe this is not so
> good...), so finally we get:
> 
>   vec = rtvec_alloc (n); 
>   for (int i = 0; i < n; ++i) 
>     RTVEC_ELT (vec, i) = op;
>   rtx ret = gen_reg_rtx (vmode);
>   emit_insn (GEN_FCN (icode) (ret, gen_rtx_PARALLEL (vmode, vec)));
> 
> here "op" is (and:SI (neg:SI (reg:SI 85)) (const_int 31)), thus it
> evaded expansion :(.

Then that seems like a bug in the loongarch vec_init pattern(s).
Those really don't have a predicate in any of the backends on the input
operand, so they need to force_reg it if it is something it can't handle.
I've looked e.g. at i386 vec_init and that is exactly what it does,
see the various tests + force_reg calls in ix86_expand_vector_init*.

	Jakub

next prev parent reply	other threads:[~2023-12-18 20:16 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-12-18 13:42 Xi Ruoyao
2023-12-18 15:39 ` Jeff Law
2023-12-18 16:48   ` Xi Ruoyao
2023-12-18 17:45     ` Jakub Jelinek
2023-12-18 20:01       ` Xi Ruoyao
2023-12-18 20:15         ` Jakub Jelinek [this message]
2023-12-18 20:18           ` Xi Ruoyao

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=ZYCod5F4cTUVSfqu@tucnak \
    --to=jakub@redhat.com \
    --cc=c@jia.je \
    --cc=chenglulu@loongson.cn \
    --cc=gcc-patches@gcc.gnu.org \
    --cc=i@xen0n.name \
    --cc=jeffreyalaw@gmail.com \
    --cc=xry111@xry111.site \
    --cc=xuchenghua@loongson.cn \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).