public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug other/57668] New: Regression in vectorizing memcpy pattern.
@ 2013-06-21 10:36 neleai at seznam dot cz
  2013-06-21 10:37 ` [Bug other/57668] " neleai at seznam dot cz
                   ` (2 more replies)
  0 siblings, 3 replies; 4+ messages in thread
From: neleai at seznam dot cz @ 2013-06-21 10:36 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57668

            Bug ID: 57668
           Summary: Regression in vectorizing memcpy pattern.
           Product: gcc
           Version: 4.9.0
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: other
          Assignee: unassigned at gcc dot gnu.org
          Reporter: neleai at seznam dot cz

Hi,
When I ran atached benchmark that test how gcc can optimize byte by byte
memcpy(attached memcpy_byte.c) I got a regression on nehalem and ivy_bridge
architectures.
I ran it by commands ./run machine 2> machine_result

For ivy bridge results between 4.7 and 4.9 are:

memcpyO3-4.7.s
0.66
0.65
0.64
0.64
0.64
memcpyO3-4.9.s
0.74
0.74
0.73
0.74
0.74

Also when I look at assemblies and 4.9 version is excessively large compared to
4.7 one.


^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug other/57668] Regression in vectorizing memcpy pattern.
  2013-06-21 10:36 [Bug other/57668] New: Regression in vectorizing memcpy pattern neleai at seznam dot cz
@ 2013-06-21 10:37 ` neleai at seznam dot cz
  2013-06-24 11:05 ` rguenth at gcc dot gnu.org
  2021-07-19 19:14 ` [Bug tree-optimization/57668] " pinskia at gcc dot gnu.org
  2 siblings, 0 replies; 4+ messages in thread
From: neleai at seznam dot cz @ 2013-06-21 10:37 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57668

--- Comment #1 from Ondrej Bilka <neleai at seznam dot cz> ---
Created attachment 30333
  --> http://gcc.gnu.org/bugzilla/attachment.cgi?id=30333&action=edit
benchmark for memcpy


^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug other/57668] Regression in vectorizing memcpy pattern.
  2013-06-21 10:36 [Bug other/57668] New: Regression in vectorizing memcpy pattern neleai at seznam dot cz
  2013-06-21 10:37 ` [Bug other/57668] " neleai at seznam dot cz
@ 2013-06-24 11:05 ` rguenth at gcc dot gnu.org
  2021-07-19 19:14 ` [Bug tree-optimization/57668] " pinskia at gcc dot gnu.org
  2 siblings, 0 replies; 4+ messages in thread
From: rguenth at gcc dot gnu.org @ 2013-06-24 11:05 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57668

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |NEW
   Last reconfirmed|                            |2013-06-24
                 CC|                            |hubicka at gcc dot gnu.org
     Ever confirmed|0                           |1

--- Comment #2 from Richard Biener <rguenth at gcc dot gnu.org> ---
Confirmed.

memcpy_byte.c:2: note: Completely unroll loop 14 times

the new unrolling code unrolls the epilogue loop completely:

Estimating sizes for loop 3
 BB: 8, after_exit: 0
  size:   0 i.0_40 = (sizetype) i_38;
  size:   1 _41 = s_7(D) + i.0_40;
  size:   1 _42 = d_9(D) + i.0_40;
  size:   1 _43 = *_42;
  size:   1 *_41 = _43;
  size:   1 i_45 = i_38 + 1;
  size:   2 if (i_45 < _16)
   Exit condition will be eliminated in last copy.
 BB: 9, after_exit: 1
size: 7-0, last_iteration: 7-2
  Loop size: 7
  Estimated size after unrolling: 68

And loop distribution fails to detect the memcpy clone, too.


^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug tree-optimization/57668] Regression in vectorizing memcpy pattern.
  2013-06-21 10:36 [Bug other/57668] New: Regression in vectorizing memcpy pattern neleai at seznam dot cz
  2013-06-21 10:37 ` [Bug other/57668] " neleai at seznam dot cz
  2013-06-24 11:05 ` rguenth at gcc dot gnu.org
@ 2021-07-19 19:14 ` pinskia at gcc dot gnu.org
  2 siblings, 0 replies; 4+ messages in thread
From: pinskia at gcc dot gnu.org @ 2021-07-19 19:14 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=57668

Andrew Pinski <pinskia at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Last reconfirmed|2013-06-24 00:00:00         |2021-7-19
         Resolution|---                         |FIXED
             Status|NEW                         |RESOLVED
   Target Milestone|---                         |8.3

--- Comment #3 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
Fixed in 8.3 at least for skylake-512 (Intel(R) Xeon(R) D-2166NT CPU @
2.00GHz):
memcpyO3-831.s
0.68
memcpyO3-4.7.s
0.62
memcpyO3-4.9.s
0.78
memcpyO3-11.s
0.63

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2021-07-19 19:14 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2013-06-21 10:36 [Bug other/57668] New: Regression in vectorizing memcpy pattern neleai at seznam dot cz
2013-06-21 10:37 ` [Bug other/57668] " neleai at seznam dot cz
2013-06-24 11:05 ` rguenth at gcc dot gnu.org
2021-07-19 19:14 ` [Bug tree-optimization/57668] " pinskia at gcc dot gnu.org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).