public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug tree-optimization/36281]  New: vectorized code not parallelized
@ 2008-05-20 19:26 spop at gcc dot gnu dot org
  2008-05-20 19:42 ` [Bug tree-optimization/36281] " pinskia at gcc dot gnu dot org
                   ` (2 more replies)
  0 siblings, 3 replies; 4+ messages in thread
From: spop at gcc dot gnu dot org @ 2008-05-20 19:26 UTC (permalink / raw)
  To: gcc-bugs

The testcase of PR36181 should be parallelized after being vectorized.

/* { dg-do compile } */
/* { dg-options "-O3 -ftree-parallelize-loops=2" } */

int foo ()
{
  int i, sum = 0, data[1024];

  for(i = 0; i<1024; i++)
    sum += data[i];

  return sum;
}

The fix for PR36181 was to disable the parallelization of a loop when
one of the phi nodes had a vector type.  This testcase should also be
parallelized.  See also the comments from the fix for PR36181:
http://gcc.gnu.org/ml/gcc-patches/2008-05/msg01217.html


-- 
           Summary: vectorized code not parallelized
           Product: gcc
           Version: 4.4.0
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: tree-optimization
        AssignedTo: spop at gcc dot gnu dot org
        ReportedBy: spop at gcc dot gnu dot org


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=36281


^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug tree-optimization/36281] vectorized code not parallelized
  2008-05-20 19:26 [Bug tree-optimization/36281] New: vectorized code not parallelized spop at gcc dot gnu dot org
@ 2008-05-20 19:42 ` pinskia at gcc dot gnu dot org
  2008-12-28  3:26 ` [Bug tree-optimization/36281] vector code is " pinskia at gcc dot gnu dot org
  2010-07-19  8:25 ` rob1weld at aol dot com
  2 siblings, 0 replies; 4+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2008-05-20 19:42 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #1 from pinskia at gcc dot gnu dot org  2008-05-20 19:42 -------
Even worse:
#define vector __attribute__((vector_size(16) ))
vector int foo ()
{
  vector int i, sum = 0, data[1024];

  for(i = 0; i<1024; i++)
    sum += data[i];

  return sum;
}

With -O2 -ftree-parallelize-loops=2, this does not get parallelized at all even
though we did not run the vectorizer.


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=36281


^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug tree-optimization/36281] vector code is not parallelized
  2008-05-20 19:26 [Bug tree-optimization/36281] New: vectorized code not parallelized spop at gcc dot gnu dot org
  2008-05-20 19:42 ` [Bug tree-optimization/36281] " pinskia at gcc dot gnu dot org
@ 2008-12-28  3:26 ` pinskia at gcc dot gnu dot org
  2010-07-19  8:25 ` rob1weld at aol dot com
  2 siblings, 0 replies; 4+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2008-12-28  3:26 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #2 from pinskia at gcc dot gnu dot org  2008-12-28 03:23 -------
Confirmed.


-- 

pinskia at gcc dot gnu dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |pinskia at gcc dot gnu dot
                   |                            |org
           Severity|normal                      |enhancement
             Status|UNCONFIRMED                 |NEW
     Ever Confirmed|0                           |1
           Keywords|                            |missed-optimization
   Last reconfirmed|0000-00-00 00:00:00         |2008-12-28 03:23:49
               date|                            |
            Summary|vectorized code not         |vector code is not
                   |parallelized                |parallelized


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=36281


^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug tree-optimization/36281] vector code is not parallelized
  2008-05-20 19:26 [Bug tree-optimization/36281] New: vectorized code not parallelized spop at gcc dot gnu dot org
  2008-05-20 19:42 ` [Bug tree-optimization/36281] " pinskia at gcc dot gnu dot org
  2008-12-28  3:26 ` [Bug tree-optimization/36281] vector code is " pinskia at gcc dot gnu dot org
@ 2010-07-19  8:25 ` rob1weld at aol dot com
  2 siblings, 0 replies; 4+ messages in thread
From: rob1weld at aol dot com @ 2010-07-19  8:25 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #3 from rob1weld at aol dot com  2010-07-19 08:25 -------
> ... this does not get parallelized at all ...
Also see 34501

Perhaps we could make some use of Pluto. It is a fully automatic (C to OpenMP
C) parallelizer that makes code amenable to auto-vectorization.

http://pluto-compiler.sourceforge.net/


Also see these Parallelizers:
http://cri.ensmp.fr/pips/ or http://pips4u.org/
There was something I found a few days ago from here that I can no longer
locate
http://en.wikipedia.org/wiki/Automatic_parallelization

It would be great to take that inner loop (if it were much larger) and
'Kernelize' it for co-processing on our Graphics Card. We could expand GCCs
'x-parallelize-x' and threading options to automatically find the sweeter spots
to offload for co=processing (on a GPU, using OpenCL).

Barra - NVIDIA G80 GPU Functional Simulator
http://gpgpu.univ-perp.fr/index.php/Barra

If we were 'allowed' to call a post-processor (like LTO used to do) we could
call ATI's GPU SDK which supports OpenCL and outputs code BOTH to x86 and it's
own GPUs. 


Commercial Projects:
Auto-parallelizer and SIMDinator by Dalsoft
http://www.dalsoft.com/documentation_simdinator.html

NVidia's PTX
http://en.wikipedia.org/wiki/Parallel_Thread_Execution

Cray's work with LLVM
http://llvm.org/devmtg/2009-10/Greene_180k_Cores.pdf

Larrabee
http://www.drdobbs.com/architecture-and-design/216402188?pgno=5


Rob


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=36281


^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2010-07-19  8:25 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2008-05-20 19:26 [Bug tree-optimization/36281] New: vectorized code not parallelized spop at gcc dot gnu dot org
2008-05-20 19:42 ` [Bug tree-optimization/36281] " pinskia at gcc dot gnu dot org
2008-12-28  3:26 ` [Bug tree-optimization/36281] vector code is " pinskia at gcc dot gnu dot org
2010-07-19  8:25 ` rob1weld at aol dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).