public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug tree-optimization/96789] New: x264: sub4x4_dct() improves when vectorization is disabled
@ 2020-08-26  3:19 linkw at gcc dot gnu.org
  2020-08-26  6:53 ` [Bug target/96789] " rguenth at gcc dot gnu.org
                   ` (36 more replies)
  0 siblings, 37 replies; 38+ messages in thread
From: linkw at gcc dot gnu.org @ 2020-08-26  3:19 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96789

            Bug ID: 96789
           Summary: x264: sub4x4_dct() improves when vectorization is
                    disabled
           Product: gcc
           Version: 11.0
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: tree-optimization
          Assignee: unassigned at gcc dot gnu.org
          Reporter: linkw at gcc dot gnu.org
  Target Milestone: ---

One of my workmates found that if we disable vectorization for SPEC2017
525.x264_r function sub4x4_dct in source file x264_src/common/dct.c with
explicit function attribute __attribute__((optimize("no-tree-vectorize"))), it
can speed up by 4%.

The option used is: -O3 -mcpu=power9 -fcommon -fno-strict-aliasing
-fgnu89-inline

I confirmed this finding and it can further narrow down to SLP vectorization
with attribute __attribute__((optimize("no-tree-slp-vectorize"))).

I also checked with r11-0 commit for this particular file, the performance keep
unchanged, with/without vectorization attribute. So I think it's a trunk
regression, probably exposes one SLP flaw or one cost modeling issue.

^ permalink raw reply	[flat|nested] 38+ messages in thread

end of thread, other threads:[~2020-11-05  2:25 UTC | newest]

Thread overview: 38+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-08-26  3:19 [Bug tree-optimization/96789] New: x264: sub4x4_dct() improves when vectorization is disabled linkw at gcc dot gnu.org
2020-08-26  6:53 ` [Bug target/96789] " rguenth at gcc dot gnu.org
2020-08-26  7:13 ` linkw at gcc dot gnu.org
2020-08-27  3:28 ` linkw at gcc dot gnu.org
2020-08-27  6:40 ` rguenth at gcc dot gnu.org
2020-08-27 11:16 ` rguenth at gcc dot gnu.org
2020-08-31  4:05 ` linkw at gcc dot gnu.org
2020-09-16 10:03 ` linkw at gcc dot gnu.org
2020-09-16 11:17 ` rguenth at gcc dot gnu.org
2020-09-16 12:25 ` linkw at gcc dot gnu.org
2020-09-16 13:04 ` rguenth at gcc dot gnu.org
2020-09-17  2:50 ` linkw at gcc dot gnu.org
2020-09-17  5:06 ` linkw at gcc dot gnu.org
2020-09-18  9:11 ` linkw at gcc dot gnu.org
2020-09-18 10:30 ` rguenther at suse dot de
2020-09-18 13:40 ` linkw at gcc dot gnu.org
2020-09-19 15:45 ` crazylht at gmail dot com
2020-09-21  7:14 ` rguenther at suse dot de
2020-09-25 12:46 ` linkw at gcc dot gnu.org
2020-09-25 12:52 ` linkw at gcc dot gnu.org
2020-09-25 12:57 ` rguenth at gcc dot gnu.org
2020-09-25 13:05 ` rguenth at gcc dot gnu.org
2020-09-27  2:56 ` crazylht at gmail dot com
2020-09-27  3:07 ` crazylht at gmail dot com
2020-09-27  6:48 ` rguenther at suse dot de
2020-09-27 10:20 ` linkw at gcc dot gnu.org
2020-09-27 10:36 ` linkw at gcc dot gnu.org
2020-09-27 10:42 ` linkw at gcc dot gnu.org
2020-09-28  5:45 ` crazylht at gmail dot com
2020-09-28  6:40 ` linkw at gcc dot gnu.org
2020-09-28  6:48 ` rguenth at gcc dot gnu.org
2020-09-28  6:59 ` rguenth at gcc dot gnu.org
2020-09-28 12:54 ` linkw at gcc dot gnu.org
2020-09-28 13:27 ` rguenth at gcc dot gnu.org
2020-09-29  2:05 ` crazylht at gmail dot com
2020-09-29 12:27 ` rsandifo at gcc dot gnu.org
2020-11-03  6:28 ` cvs-commit at gcc dot gnu.org
2020-11-05  2:25 ` linkw at gcc dot gnu.org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).