From: Vidya Praveen <vidyapraveen@arm.com>
To: Richard Biener <rguenther@suse.de>
Cc: "gcc@gcc.gnu.org" <gcc@gcc.gnu.org>, "ook@ucw.cz" <ook@ucw.cz>
Subject: Re: [RFC] Vectorization of indexed elements
Date: Mon, 30 Sep 2013 14:00:00 -0000 [thread overview]
Message-ID: <20130930140001.GF3460@e103625-lin.cambridge.arm.com> (raw)
In-Reply-To: <alpine.LNX.2.00.1309301504120.5759@zhemvz.fhfr.qr>
On Mon, Sep 30, 2013 at 02:19:32PM +0100, Richard Biener wrote:
> On Mon, 30 Sep 2013, Vidya Praveen wrote:
>
> > On Fri, Sep 27, 2013 at 04:19:45PM +0100, Vidya Praveen wrote:
> > > On Fri, Sep 27, 2013 at 03:50:08PM +0100, Vidya Praveen wrote:
> > > [...]
> > > > > > I can't really insist on the single lane load.. something like:
> > > > > >
> > > > > > vc:V4SI[0] = c
> > > > > > vt:V4SI = vec_duplicate:V4SI (vec_select:SI vc:V4SI 0)
> > > > > > va:V4SI = vb:V4SI <op> vt:V4SI
> > > > > >
> > > > > > Or is there any other way to do this?
> > > > >
> > > > > Can you elaborate on "I can't really insist on the single lane load"?
> > > > > What's the single lane load in your example?
> > > >
> > > > Loading just one lane of the vector like this:
> > > >
> > > > vc:V4SI[0] = c // from the above scalar example
> > > >
> > > > or
> > > >
> > > > vc:V4SI[0] = c[2]
> > > >
> > > > is what I meant by single lane load. In this example:
> > > >
> > > > t = c[2]
> > > > ...
> > > > vb:v4si = b[0:3]
> > > > vc:v4si = { t, t, t, t }
> > > > va:v4si = vb:v4si <op> vc:v4si
> > > >
> > > > If we are expanding the CONSTRUCTOR as vec_duplicate at vec_init, I cannot
> > > > insist 't' to be vector and t = c[2] to be vect_t[0] = c[2] (which could be
> > > > seen as vec_select:SI (vect_t 0) ).
> > > >
> > > > > I'd expect the instruction
> > > > > pattern as quoted to just work (and I hope we expand an uniform
> > > > > constructor { a, a, a, a } properly using vec_duplicate).
> > > >
> > > > As much as I went through the code, this is only done using vect_init. It is
> > > > not expanded as vec_duplicate from, for example, store_constructor() of expr.c
> > >
> > > Do you see any issues if we expand such constructor as vec_duplicate directly
> > > instead of going through vect_init way?
> >
> > Sorry, that was a bad question.
> >
> > But here's what I would like to propose as a first step. Please tell me if this
> > is acceptable or if it makes sense:
> >
> > - Introduce standard pattern names
> >
> > "vmulim4" - vector muliply with second operand as indexed operand
> >
> > Example:
> >
> > (define_insn "vmuliv4si4"
> > [set (match_operand:V4SI 0 "register_operand")
> > (mul:V4SI (match_operand:V4SI 1 "register_operand")
> > (vec_duplicate:V4SI
> > (vec_select:SI
> > (match_operand:V4SI 2 "register_operand")
> > (match_operand:V4SI 3 "immediate_operand)))))]
> > ...
> > )
>
> We could factor this with providing a standard pattern name for
>
> (define_insn "vdupi<mode>"
> [set (match_operand:<mode> 0 "register_operand")
> (vec_duplicate:<mode>
> (vec_select:<scalarmode>
> (match_operand:<mode> 1 "register_operand")
> (match_operand:SI 2 "immediate_operand))))]
This is good. I did think about this but then I thought of avoiding the need
for combiner patterns :-)
But do you find the lane specific mov pattern I proposed, acceptable?
> (you use V4SI for the immediate?
Sorry typo again!! It should've been SI.
> Ideally vdupi has another custom
> mode for the vector index).
>
> Note that this factored pattern is already available as vec_perm_const!
> It is simply (vec_perm_const:V4SI <source> <source> <immediate-selector>).
>
> Which means that on the GIMPLE level we should try to combine
>
> el_4 = BIT_FIELD_REF <v_3, ...>;
> v_5 = { el_4, el_4, ... };
I don't think we reach this state at all for the scenarios in discussion.
what we generally have is:
el_4 = MEM_REF < array + index*size >
v_5 = { el_4, ... }
Or am I missing something?
>
> into
>
> v_5 = VEC_PERM_EXPR <v_3, v_3, ...>;
>
> which it should already do with simplify_permutation.
>
> But I'm not sure what you are after at then end ;)
>
> Richard.
>
Regards
VP
next prev parent reply other threads:[~2013-09-30 14:00 UTC|newest]
Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-09-09 17:25 Vidya Praveen
2013-09-09 18:02 ` Marc Glisse
2013-09-10 8:25 ` Richard Biener
2013-09-24 15:03 ` Vidya Praveen
2013-09-25 9:22 ` Richard Biener
2013-09-30 13:01 ` Vidya Praveen
2013-09-24 15:04 ` Vidya Praveen
2013-09-25 9:25 ` Richard Biener
2013-09-27 14:50 ` Vidya Praveen
2013-09-27 15:19 ` Vidya Praveen
2013-09-30 12:55 ` Vidya Praveen
2013-09-30 13:19 ` Richard Biener
2013-09-30 14:00 ` Vidya Praveen [this message]
2013-10-01 8:26 ` Richard Biener
2013-10-11 14:54 ` Vidya Praveen
2013-10-11 15:05 ` Jakub Jelinek
2013-12-04 17:07 ` Vidya Praveen
2013-10-14 8:05 ` Richard Biener
2013-12-04 16:10 ` Vidya Praveen
2013-12-06 11:48 ` Richard Biener
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20130930140001.GF3460@e103625-lin.cambridge.arm.com \
--to=vidyapraveen@arm.com \
--cc=gcc@gcc.gnu.org \
--cc=ook@ucw.cz \
--cc=rguenther@suse.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).