Re:[pushed] [PATCH v2] LoongArch: Adjust cost of vector_stmt that match multiply-add pattern.

public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed

From: chenglulu <chenglulu@loongson.cn>
To: Li Wei <liwei@loongson.cn>, gcc-patches@gcc.gnu.org
Cc: xry111@xry111.site, i@xen0n.name, xuchenghua@loongson.cn
Subject: Re:[pushed] [PATCH v2] LoongArch: Adjust cost of vector_stmt that match multiply-add pattern.
Date: Fri, 2 Feb 2024 09:23:46 +0800	[thread overview]
Message-ID: <6185ffcd-a553-28b4-2a08-d224729809b3@loongson.cn> (raw)
In-Reply-To: <20240126084111.1811519-1-liwei@loongson.cn>

Pushed to r14-8722.

在 2024/1/26 下午4:41, Li Wei 写道:
> We found that when only 128-bit vectorization was enabled, 549.fotonik3d_r
> failed to vectorize effectively. For this reason, we adjust the cost of
> 128-bit vector_stmt that match the multiply-add pattern to facilitate 128-bit
> vectorization.
> The experimental results show that after the modification, 549.fotonik3d_r
> performance can be improved by 9.77% under the 128-bit vectorization option.
>
> gcc/ChangeLog:
>
> 	* config/loongarch/loongarch.cc (loongarch_multiply_add_p): New.
> 	(loongarch_vector_costs::add_stmt_cost): Adjust.
>
> gcc/testsuite/ChangeLog:
>
> 	* gfortran.dg/vect/vect-10.f90: New test.
> ---
>   gcc/config/loongarch/loongarch.cc          | 48 +++++++++++++++
>   gcc/testsuite/gfortran.dg/vect/vect-10.f90 | 71 ++++++++++++++++++++++
>   2 files changed, 119 insertions(+)
>   create mode 100644 gcc/testsuite/gfortran.dg/vect/vect-10.f90
>
> diff --git a/gcc/config/loongarch/loongarch.cc b/gcc/config/loongarch/loongarch.cc
> index b494040d165..4d99e30828b 100644
> --- a/gcc/config/loongarch/loongarch.cc
> +++ b/gcc/config/loongarch/loongarch.cc
> @@ -4096,6 +4096,37 @@ loongarch_vector_costs::determine_suggested_unroll_factor (loop_vec_info loop_vi
>     return 1 << ceil_log2 (uf);
>   }
>   
> +/* Check if assign stmt rhs op comes from a multiply-add operation.  */
> +static bool
> +loongarch_multiply_add_p (vec_info *vinfo, stmt_vec_info stmt_info)
> +{
> +  gassign *assign = dyn_cast<gassign *> (stmt_info->stmt);
> +  if (!assign)
> +    return false;
> +  tree_code code = gimple_assign_rhs_code (assign);
> +  if (code != PLUS_EXPR && code != MINUS_EXPR)
> +    return false;
> +
> +  auto is_mul_result = [&](int i)
> +    {
> +      tree rhs = gimple_op (assign, i);
> +      if (TREE_CODE (rhs) != SSA_NAME)
> +	return false;
> +
> +      stmt_vec_info def_stmt_info = vinfo->lookup_def (rhs);
> +      if (!def_stmt_info
> +	  || STMT_VINFO_DEF_TYPE (def_stmt_info) != vect_internal_def)
> +	return false;
> +      gassign *rhs_assign = dyn_cast<gassign *> (def_stmt_info->stmt);
> +      if (!rhs_assign || gimple_assign_rhs_code (rhs_assign) != MULT_EXPR)
> +	return false;
> +
> +      return true;
> +    };
> +
> +  return is_mul_result (1) || is_mul_result (2);
> +}
> +
>   unsigned
>   loongarch_vector_costs::add_stmt_cost (int count, vect_cost_for_stmt kind,
>   				       stmt_vec_info stmt_info, slp_tree,
> @@ -4108,6 +4139,23 @@ loongarch_vector_costs::add_stmt_cost (int count, vect_cost_for_stmt kind,
>       {
>         int stmt_cost = loongarch_builtin_vectorization_cost (kind, vectype,
>   							    misalign);
> +      if (vectype && stmt_info)
> +	{
> +	  gassign *assign = dyn_cast<gassign *> (STMT_VINFO_STMT (stmt_info));
> +	  machine_mode mode = TYPE_MODE (vectype);
> +
> +	  /* We found through testing that this strategy (the stmt that
> +	     matches the multiply-add pattern) has positive returns only
> +	     when applied to the 128-bit vector stmt, so this restriction
> +	     is currently made.  */
> +	  if (kind == vector_stmt && GET_MODE_SIZE (mode) == 16 && assign)
> +	    {
> +	      if (!vect_is_reduction (stmt_info)
> +		  && loongarch_multiply_add_p (m_vinfo, stmt_info))
> +		stmt_cost = 0;
> +	    }
> +	}
> +
>         retval = adjust_cost_for_freq (stmt_info, where, count * stmt_cost);
>         m_costs[where] += retval;
>   
> diff --git a/gcc/testsuite/gfortran.dg/vect/vect-10.f90 b/gcc/testsuite/gfortran.dg/vect/vect-10.f90
> new file mode 100644
> index 00000000000..b85bc2702a3
> --- /dev/null
> +++ b/gcc/testsuite/gfortran.dg/vect/vect-10.f90
> @@ -0,0 +1,71 @@
> +! { dg-do compile }
> +! { dg-additional-options "-Ofast -mlsx -fvect-cost-model=dynamic" { target loongarch64*-*-* } }
> +
> +MODULE material_mod
> +
> +IMPLICIT NONE
> +
> +integer, parameter :: dfp = selected_real_kind (13, 99)
> +integer, parameter :: rfp = dfp
> +
> +PUBLIC Mat_updateE, iepx, iepy, iepz
> +
> +PRIVATE
> +
> +integer, dimension (:, :, :), allocatable :: iepx, iepy, iepz
> +real (kind = rfp), dimension (:), allocatable :: Dbdx, Dbdy, Dbdz
> +integer :: imin, jmin, kmin
> +integer, dimension (6) :: Exsize
> +integer, dimension (6) :: Eysize
> +integer, dimension (6) :: Ezsize
> +integer, dimension (6) :: Hxsize
> +integer, dimension (6) :: Hysize
> +integer, dimension (6) :: Hzsize
> +
> +CONTAINS
> +
> +SUBROUTINE mat_updateE (nx, ny, nz, Hx, Hy, Hz, Ex, Ey, Ez)
> +
> +integer, intent (in) :: nx, ny, nz
> +
> +real (kind = rfp), intent (inout),                                         &
> +  dimension (Exsize (1) : Exsize (2), Exsize (3) : Exsize (4), Exsize (5) : Exsize (6)) :: Ex
> +real (kind = rfp), intent (inout),                                         &
> +  dimension (Eysize (1) : Eysize (2), Eysize (3) : Eysize (4), Eysize (5) : Eysize (6)) :: Ey
> +real (kind = rfp), intent (inout),                                         &
> +  dimension (Ezsize (1) : Ezsize (2), Ezsize (3) : Ezsize (4), Ezsize (5) : Ezsize (6)) :: Ez
> +real (kind = rfp), intent (in),                                            &
> +  dimension (Hxsize (1) : Hxsize (2), Hxsize (3) : Hxsize (4), Hxsize (5) : Hxsize (6)) :: Hx
> +real (kind = rfp), intent (in),                                            &
> +  dimension (Hysize (1) : Hysize (2), Hysize (3) : Hysize (4), Hysize (5) : Hysize (6)) :: Hy
> +real (kind = rfp), intent (in),                                            &
> +  dimension (Hzsize (1) : Hzsize (2), Hzsize (3) : Hzsize (4), Hzsize (5) : Hzsize (6)) :: Hz
> +
> +integer :: i, j, k, mp
> +
> +do k = kmin, nz
> +  do j = jmin, ny
> +    do i = imin, nx
> +      mp = iepx (i, j, k)
> +      Ex (i, j, k) = Ex (i, j, k) +                                        &
> +                  Dbdy (mp) * (Hz (i, j, k  ) - Hz (i, j-1, k)) +          &
> +                  Dbdz (mp) * (Hy (i, j, k-1) - Hy (i, j  , k))
> +
> +      mp = iepy (i, j, k)
> +      Ey (i, j, k) = Ey (i, j, k) +                                        &
> +                  Dbdz (mp) * (Hx (i  , j, k) - Hx (i, j, k-1)) +          &
> +                  Dbdx (mp) * (Hz (i-1, j, k) - Hz (i, j, k  ))
> +
> +      mp = iepz (i, j, k)
> +      Ez (i, j, k) = Ez (i, j, k) +                                        &
> +                  Dbdx (mp) * (Hy (i, j  , k) - Hy (i-1, j, k)) +          &
> +                  Dbdy (mp) * (Hx (i, j-1, k) - Hx (i  , j, k))
> +    end do
> +  end do
> +end do
> +
> +END SUBROUTINE mat_updateE
> +
> +END MODULE material_mod
> +
> +! { dg-final { scan-tree-dump-times "vectorized 1 loops" 1 "vect" { target loongarch64*-*-* } } }

     prev parent reply	other threads:[~2024-02-02  1:23 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-01-26  8:41 Li Wei
2024-02-02  1:23 ` chenglulu [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=6185ffcd-a553-28b4-2a08-d224729809b3@loongson.cn \
    --to=chenglulu@loongson.cn \
    --cc=gcc-patches@gcc.gnu.org \
    --cc=i@xen0n.name \
    --cc=liwei@loongson.cn \
    --cc=xry111@xry111.site \
    --cc=xuchenghua@loongson.cn \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).