public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug c/112399] New: RISC-V: Missed AVL propagation for complicate reduction case
@ 2023-11-06  1:40 juzhe.zhong at rivai dot ai
  2023-11-07  7:02 ` [Bug target/112399] " cvs-commit at gcc dot gnu.org
                   ` (2 more replies)
  0 siblings, 3 replies; 4+ messages in thread
From: juzhe.zhong at rivai dot ai @ 2023-11-06  1:40 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112399

            Bug ID: 112399
           Summary: RISC-V: Missed AVL propagation for complicate
                    reduction case
           Product: gcc
           Version: 14.0
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: c
          Assignee: unassigned at gcc dot gnu.org
          Reporter: juzhe.zhong at rivai dot ai
  Target Milestone: ---

double foo (double *__restrict a, 
double *__restrict b, 
double *__restrict c,
int n)
{
  double result = 0;
  for (int i = 0; i < n; i++)
    result += a[i] * b[i] * c[i];
  return result;
}

https://godbolt.org/z/znqcf7ehz

        vsetvli a5,a3,e8,mf8,ta,ma   -----> should be change into e64m1 Tuma
        slli    a4,a5,3
        vle64.v v4,0(a0)
        vle64.v v1,0(a1)
        vle64.v v3,0(a2)
        sub     a3,a3,a5
        vsetvli a6,zero,e64,m1,ta,ma   ---> redundant 
        add     a0,a0,a4
        vfmul.vv        v1,v1,v4
        add     a1,a1,a4
        vsetvli zero,a5,e64,m1,tu,ma   ---> redundant 
        add     a2,a2,a4
        vfmacc.vv       v2,v3,v1
        bne     a3,zero,.L3
        fmv.d.x fa5,zero
        vsetvli a6,zero,e64,m1,ta,ma
        vfmv.s.f        v1,fa5
        vfredusum.vs    v2,v2,v1
        vfmv.f.s        fa0,v2
        ret

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug target/112399] RISC-V: Missed AVL propagation for complicate reduction case
  2023-11-06  1:40 [Bug c/112399] New: RISC-V: Missed AVL propagation for complicate reduction case juzhe.zhong at rivai dot ai
@ 2023-11-07  7:02 ` cvs-commit at gcc dot gnu.org
  2023-11-07  7:03 ` juzhe.zhong at rivai dot ai
  2023-11-12 21:17 ` pinskia at gcc dot gnu.org
  2 siblings, 0 replies; 4+ messages in thread
From: cvs-commit at gcc dot gnu.org @ 2023-11-07  7:02 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112399

--- Comment #1 from CVS Commits <cvs-commit at gcc dot gnu.org> ---
The master branch has been updated by Pan Li <panli@gcc.gnu.org>:

https://gcc.gnu.org/g:f1e084c6c3ef1d1233e35823dacfdf9cee722430

commit r14-5179-gf1e084c6c3ef1d1233e35823dacfdf9cee722430
Author: Juzhe-Zhong <juzhe.zhong@rivai.ai>
Date:   Mon Nov 6 11:34:26 2023 +0800

    RISC-V: Enhance AVL propagation for complicate reduction auto-vectorization

    I notice we failed to AVL propagate for reduction with more complicate
situation:

    double foo (double *__restrict a,
    double *__restrict b,
    double *__restrict c,
    int n)
    {
      double result = 0;
      for (int i = 0; i < n; i++)
        result += a[i] * b[i] * c[i];
      return result;
    }

            vsetvli a5,a3,e8,mf8,ta,ma           -> should be fused into
e64m1,TU
            slli    a4,a5,3
            vle64.v v3,0(a0)
            vle64.v v1,0(a1)
            vsetvli a6,zero,e64,m1,ta,ma         -> redundant
            vfmul.vv        v1,v1,v3
            vsetvli zero,a5,e64,m1,tu,ma         -> redundant
            vle64.v v3,0(a2)
            vfmacc.vv       v2,v1,v3
            add     a0,a0,a4
            add     a1,a1,a4
            add     a2,a2,a4
            sub     a3,a3,a5
            bne     a3,zero,.L3

    The failed AVL propgation causes redundant AVL/VL togglling.
    The root cause as follows:

    vsetvl a5, zero
    vadd.vv def r136
    vsetvl zero, a3, ... TU
    vsub.vv (use r136)

    We propagate AVL (r136) from 'vsub.vv' into 'vadd.vv' when 'vsub.vv' is TA
policy.
    However, it's too restrict so we missed optimization here. We enhance AVL
propation
    for TU policy for following situation:

    vsetvl a5, zero
    vadd.vv def r136
    vsetvl zero, a3, ... TU
    vsub.vv (use r136, merge != r136)

    Note that we should only propagate AVL when merge != r136 for 'vsub.vv'
doesn't
    depend on the tail elements.
    After this patch:

            vsetvli a5,a3,e64,m1,tu,ma
            slli    a4,a5,3
            vle64.v v3,0(a0)
            vle64.v v1,0(a1)
            vfmul.vv        v1,v1,v3
            vle64.v v3,0(a2)
            vfmacc.vv       v2,v3,v1
            add     a0,a0,a4
            add     a1,a1,a4
            add     a2,a2,a4
            sub     a3,a3,a5
            bne     a3,zero,.L3

            PR target/112399

    gcc/ChangeLog:

            * config/riscv/riscv-avlprop.cc
            (pass_avlprop::get_vlmax_ta_preferred_avl): Enhance AVL
propagation.
            * config/riscv/t-riscv: Add new include.

    gcc/testsuite/ChangeLog:

            * gcc.target/riscv/rvv/vsetvl/imm_switch-2.c: Adapt test.
            * gcc.target/riscv/rvv/autovec/pr112399.c: New test.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug target/112399] RISC-V: Missed AVL propagation for complicate reduction case
  2023-11-06  1:40 [Bug c/112399] New: RISC-V: Missed AVL propagation for complicate reduction case juzhe.zhong at rivai dot ai
  2023-11-07  7:02 ` [Bug target/112399] " cvs-commit at gcc dot gnu.org
@ 2023-11-07  7:03 ` juzhe.zhong at rivai dot ai
  2023-11-12 21:17 ` pinskia at gcc dot gnu.org
  2 siblings, 0 replies; 4+ messages in thread
From: juzhe.zhong at rivai dot ai @ 2023-11-07  7:03 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112399

JuzheZhong <juzhe.zhong at rivai dot ai> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |RESOLVED
         Resolution|---                         |FIXED

--- Comment #2 from JuzheZhong <juzhe.zhong at rivai dot ai> ---
Fixed

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug target/112399] RISC-V: Missed AVL propagation for complicate reduction case
  2023-11-06  1:40 [Bug c/112399] New: RISC-V: Missed AVL propagation for complicate reduction case juzhe.zhong at rivai dot ai
  2023-11-07  7:02 ` [Bug target/112399] " cvs-commit at gcc dot gnu.org
  2023-11-07  7:03 ` juzhe.zhong at rivai dot ai
@ 2023-11-12 21:17 ` pinskia at gcc dot gnu.org
  2 siblings, 0 replies; 4+ messages in thread
From: pinskia at gcc dot gnu.org @ 2023-11-12 21:17 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112399

Andrew Pinski <pinskia at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Keywords|                            |missed-optimization
   Target Milestone|---                         |14.0

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2023-11-12 21:17 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-11-06  1:40 [Bug c/112399] New: RISC-V: Missed AVL propagation for complicate reduction case juzhe.zhong at rivai dot ai
2023-11-07  7:02 ` [Bug target/112399] " cvs-commit at gcc dot gnu.org
2023-11-07  7:03 ` juzhe.zhong at rivai dot ai
2023-11-12 21:17 ` pinskia at gcc dot gnu.org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).