public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug middle-end/103903] New: Loops handling r,g,b values are not vectorized to use power of 2 vectors even if they can
@ 2022-01-04 15:17 hubicka at gcc dot gnu.org
  2022-01-04 16:05 ` [Bug middle-end/103903] " marxin at gcc dot gnu.org
                   ` (3 more replies)
  0 siblings, 4 replies; 5+ messages in thread
From: hubicka at gcc dot gnu.org @ 2022-01-04 15:17 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103903

            Bug ID: 103903
           Summary: Loops handling r,g,b values are not vectorized to use
                    power of 2 vectors even if they can
           Product: gcc
           Version: 12.0
            Status: UNCONFIRMED
          Severity: enhancement
          Priority: P3
         Component: middle-end
          Assignee: unassigned at gcc dot gnu.org
          Reporter: hubicka at gcc dot gnu.org
  Target Milestone: ---

This is another textcase comming from Firefox's LightPixel. I am not sure if
this is duplicate, but I think it is quite common in programs dealing with RGB
values.

To match the vectorized code we would need to move from SLP vectorizing the 3
parallel computations to vectorising the loop.

struct a {float r,g,b;};
struct a src[100000], dest[100000];

void
test ()
{
  int i;
  for (i=0;i<100000;i++)
  {
          dest[i].r/=src[i].g;
          dest[i].g/=src[i].g;
          dest[i].b/=src[i].b;
  }
}

is vectorized to do 3 operaitons at a time, while equivalent:

float src[300000], dest[300000];

void
test ()
{
  int i;
  for (i=0;i<300000;i++)
  {
          dest[i]/=src[i];
  }
}

runs faster.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* [Bug middle-end/103903] Loops handling r,g,b values are not vectorized to use power of 2 vectors even if they can
  2022-01-04 15:17 [Bug middle-end/103903] New: Loops handling r,g,b values are not vectorized to use power of 2 vectors even if they can hubicka at gcc dot gnu.org
@ 2022-01-04 16:05 ` marxin at gcc dot gnu.org
  2022-01-04 19:18 ` [Bug tree-optimization/103903] " pinskia at gcc dot gnu.org
                   ` (2 subsequent siblings)
  3 siblings, 0 replies; 5+ messages in thread
From: marxin at gcc dot gnu.org @ 2022-01-04 16:05 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103903

Martin Liška <marxin at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |NEW
                 CC|                            |marxin at gcc dot gnu.org,
                   |                            |rguenth at gcc dot gnu.org
   Last reconfirmed|                            |2022-01-04
     Ever confirmed|0                           |1

^ permalink raw reply	[flat|nested] 5+ messages in thread

* [Bug tree-optimization/103903] Loops handling r,g,b values are not vectorized to use power of 2 vectors even if they can
  2022-01-04 15:17 [Bug middle-end/103903] New: Loops handling r,g,b values are not vectorized to use power of 2 vectors even if they can hubicka at gcc dot gnu.org
  2022-01-04 16:05 ` [Bug middle-end/103903] " marxin at gcc dot gnu.org
@ 2022-01-04 19:18 ` pinskia at gcc dot gnu.org
  2022-01-05  7:43 ` rguenth at gcc dot gnu.org
  2022-01-05  9:49 ` hubicka at gcc dot gnu.org
  3 siblings, 0 replies; 5+ messages in thread
From: pinskia at gcc dot gnu.org @ 2022-01-04 19:18 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103903

Andrew Pinski <pinskia at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           See Also|                            |https://gcc.gnu.org/bugzill
                   |                            |a/show_bug.cgi?id=99412

--- Comment #1 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
Basically this is re-rolling.

PR 99412 is another example of re-rolling; there might be others.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* [Bug tree-optimization/103903] Loops handling r,g,b values are not vectorized to use power of 2 vectors even if they can
  2022-01-04 15:17 [Bug middle-end/103903] New: Loops handling r,g,b values are not vectorized to use power of 2 vectors even if they can hubicka at gcc dot gnu.org
  2022-01-04 16:05 ` [Bug middle-end/103903] " marxin at gcc dot gnu.org
  2022-01-04 19:18 ` [Bug tree-optimization/103903] " pinskia at gcc dot gnu.org
@ 2022-01-05  7:43 ` rguenth at gcc dot gnu.org
  2022-01-05  9:49 ` hubicka at gcc dot gnu.org
  3 siblings, 0 replies; 5+ messages in thread
From: rguenth at gcc dot gnu.org @ 2022-01-05  7:43 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103903

--- Comment #2 from Richard Biener <rguenth at gcc dot gnu.org> ---
If you fix the loop to do

  for (i=0;i<100000;i++)
  {
          dest[i].r/=src[i].g;
          dest[i].g/=src[i].g;
          dest[i].b/=src[i].b;
  }

it's vectorized just fine (with larger than necessary VF):

.L2:
        movaps  dest+16(%rax), %xmm1
        movaps  dest+32(%rax), %xmm0
        addq    $48, %rax
        divps   src-32(%rax), %xmm1
        movaps  dest-48(%rax), %xmm2
        divps   src-16(%rax), %xmm0
        divps   src-48(%rax), %xmm2
        movaps  %xmm1, dest-32(%rax)
        movaps  %xmm2, dest-48(%rax)
        movaps  %xmm0, dest-16(%rax)
        cmpq    $1200000, %rax
        jne     .L2

so not sure what you are asking for?  Is the unrolling harmful?  It should
be doable to do the "re-rolling" on the fly in some cases but it might be
some work to tie that in.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* [Bug tree-optimization/103903] Loops handling r,g,b values are not vectorized to use power of 2 vectors even if they can
  2022-01-04 15:17 [Bug middle-end/103903] New: Loops handling r,g,b values are not vectorized to use power of 2 vectors even if they can hubicka at gcc dot gnu.org
                   ` (2 preceding siblings ...)
  2022-01-05  7:43 ` rguenth at gcc dot gnu.org
@ 2022-01-05  9:49 ` hubicka at gcc dot gnu.org
  3 siblings, 0 replies; 5+ messages in thread
From: hubicka at gcc dot gnu.org @ 2022-01-05  9:49 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103903

Jan Hubicka <hubicka at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
         Resolution|---                         |INVALID
             Status|NEW                         |RESOLVED

--- Comment #3 from Jan Hubicka <hubicka at gcc dot gnu.org> ---
Aha, sorry. I did not spot the typo in cut&paste.  Unrolling is fine.  I need
to figure out why in the real testcase we don't do the same transformation.

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2022-01-05  9:49 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-01-04 15:17 [Bug middle-end/103903] New: Loops handling r,g,b values are not vectorized to use power of 2 vectors even if they can hubicka at gcc dot gnu.org
2022-01-04 16:05 ` [Bug middle-end/103903] " marxin at gcc dot gnu.org
2022-01-04 19:18 ` [Bug tree-optimization/103903] " pinskia at gcc dot gnu.org
2022-01-05  7:43 ` rguenth at gcc dot gnu.org
2022-01-05  9:49 ` hubicka at gcc dot gnu.org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).