public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown
@ 2007-07-28 0:23 hjl at lucon dot org
2007-07-28 0:32 ` [Bug tree-optimization/32921] " pinskia at gcc dot gnu dot org
` (62 more replies)
0 siblings, 63 replies; 64+ messages in thread
From: hjl at lucon dot org @ 2007-07-28 0:23 UTC (permalink / raw)
To: gcc-bugs
This checkin:
http://gcc.gnu.org/ml/gcc-cvs/2007-07/msg00168.html
causes 12% slowdown for 437.leslie3d in SPEC CPU 2006 on 64bit Core 2
Duo when compiled with -O2 -ffast-math.
--
Summary: [4.3 Regression] Revision 126326 causes 12% slowdown
Product: gcc
Version: 4.3.0
Status: UNCONFIRMED
Severity: normal
Priority: P3
Component: tree-optimization
AssignedTo: unassigned at gcc dot gnu dot org
ReportedBy: hjl at lucon dot org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
2007-07-28 0:32 ` [Bug tree-optimization/32921] " pinskia at gcc dot gnu dot org
@ 2007-07-28 0:32 ` pinskia at gcc dot gnu dot org
2007-07-28 0:51 ` hjl at lucon dot org
` (60 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2007-07-28 0:32 UTC (permalink / raw)
To: gcc-bugs
------- Comment #1 from pinskia at gcc dot gnu dot org 2007-07-28 00:32 -------
Do you know why? Because right now there is not enough information in this bug
to declare if it is a bug or not.
--
pinskia at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|UNCONFIRMED |WAITING
Summary|[4.3 Regression] Revision |[4.3 Regression] Revision
|126326 causes 12% slowdown |126326 causes 12% slowdown
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
@ 2007-07-28 0:32 ` pinskia at gcc dot gnu dot org
2007-07-28 0:32 ` pinskia at gcc dot gnu dot org
` (61 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2007-07-28 0:32 UTC (permalink / raw)
To: gcc-bugs
------- Comment #2 from pinskia at gcc dot gnu dot org 2007-07-28 00:32 -------
Or really a scheduling fuck up in the core 2 duo.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
2007-07-28 0:32 ` [Bug tree-optimization/32921] " pinskia at gcc dot gnu dot org
2007-07-28 0:32 ` pinskia at gcc dot gnu dot org
@ 2007-07-28 0:51 ` hjl at lucon dot org
2007-07-28 1:02 ` pinskia at gcc dot gnu dot org
` (59 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: hjl at lucon dot org @ 2007-07-28 0:51 UTC (permalink / raw)
To: gcc-bugs
------- Comment #3 from hjl at lucon dot org 2007-07-28 00:51 -------
I also saw 13% slowdown on Opteron. It may also happen on other none-x86
processors.
This program is in Fortran. It has loops with 3-D, 4-D and 5-D arrays.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (2 preceding siblings ...)
2007-07-28 0:51 ` hjl at lucon dot org
@ 2007-07-28 1:02 ` pinskia at gcc dot gnu dot org
2007-07-28 10:02 ` rguenth at gcc dot gnu dot org
` (58 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2007-07-28 1:02 UTC (permalink / raw)
To: gcc-bugs
------- Comment #4 from pinskia at gcc dot gnu dot org 2007-07-28 01:02 -------
The patch just makes more types be the same inside GCC so if this caused a
regression, then this is a latent bug or it is the case where mem-ssa failed.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (3 preceding siblings ...)
2007-07-28 1:02 ` pinskia at gcc dot gnu dot org
@ 2007-07-28 10:02 ` rguenth at gcc dot gnu dot org
2007-07-28 20:51 ` hjl at lucon dot org
` (57 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2007-07-28 10:02 UTC (permalink / raw)
To: gcc-bugs
------- Comment #5 from rguenth at gcc dot gnu dot org 2007-07-28 10:02 -------
It might be a missed optimization, so it would be nice to see where the
slowdown is and what changed there wrt trees we get. (note I will not have
time to
investigate this for the next three weeks as I'm on vacation)
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (4 preceding siblings ...)
2007-07-28 10:02 ` rguenth at gcc dot gnu dot org
@ 2007-07-28 20:51 ` hjl at lucon dot org
2007-07-28 20:59 ` rguenther at suse dot de
` (56 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: hjl at lucon dot org @ 2007-07-28 20:51 UTC (permalink / raw)
To: gcc-bugs
------- Comment #6 from hjl at lucon dot org 2007-07-28 20:51 -------
This part of the patch:
Index: tree-ssa.c
===================================================================
--- tree-ssa.c (revision 126326)
+++ tree-ssa.c (working copy)
@@ -979,11 +979,9 @@ useless_type_conversion_p (tree outer_ty
return false;
/* Do not lose casts between pointers with different
- TYPE_REF_CAN_ALIAS_ALL setting or alias sets. */
- if ((TYPE_REF_CAN_ALIAS_ALL (inner_type)
- != TYPE_REF_CAN_ALIAS_ALL (outer_type))
- || (get_alias_set (TREE_TYPE (inner_type))
- != get_alias_set (TREE_TYPE (outer_type))))
+ TYPE_REF_CAN_ALIAS_ALL setting. */
+ if (TYPE_REF_CAN_ALIAS_ALL (inner_type)
+ != TYPE_REF_CAN_ALIAS_ALL (outer_type))
return false;
/* Do not lose casts from const qualified to non-const
causes this performance regression.
437.leslie3d is a big file with 19 functions. If I put each function in
a separate file, there is no regression. If I put 2 particular functions
in one file, there are 10-30% regressions depending on input file. Those
2 functions work on several global arrays.
I tried " --param max-aliased-vops=1000000 --param avg-aliased-vops=7".
It doesn't make a difference.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (5 preceding siblings ...)
2007-07-28 20:51 ` hjl at lucon dot org
@ 2007-07-28 20:59 ` rguenther at suse dot de
2007-07-28 23:25 ` hjl at lucon dot org
` (55 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenther at suse dot de @ 2007-07-28 20:59 UTC (permalink / raw)
To: gcc-bugs
------- Comment #7 from rguenther at suse dot de 2007-07-28 20:59 -------
Subject: Re: [4.3 Regression] Revision 126326
causes 12% slowdown
On Sat, 28 Jul 2007, hjl at lucon dot org wrote:
> causes this performance regression.
>
> 437.leslie3d is a big file with 19 functions. If I put each function in
> a separate file, there is no regression. If I put 2 particular functions
> in one file, there are 10-30% regressions depending on input file. Those
> 2 functions work on several global arrays.
>
> I tried " --param max-aliased-vops=1000000 --param avg-aliased-vops=7".
> It doesn't make a difference.
This sounds strange and maybe relates this to PR32624.
Richard.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (6 preceding siblings ...)
2007-07-28 20:59 ` rguenther at suse dot de
@ 2007-07-28 23:25 ` hjl at lucon dot org
2007-09-11 11:48 ` jsm28 at gcc dot gnu dot org
` (54 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: hjl at lucon dot org @ 2007-07-28 23:25 UTC (permalink / raw)
To: gcc-bugs
------- Comment #8 from hjl at lucon dot org 2007-07-28 23:25 -------
(In reply to comment #7)
> This sounds strange and maybe relates this to PR32624.
>
I tried the patch in
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32624#c4
It doesn't make a difference in this case. But both are related to
useless_type_conversion_p.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (7 preceding siblings ...)
2007-07-28 23:25 ` hjl at lucon dot org
@ 2007-09-11 11:48 ` jsm28 at gcc dot gnu dot org
2007-10-07 15:17 ` hjl at lucon dot org
` (53 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: jsm28 at gcc dot gnu dot org @ 2007-09-11 11:48 UTC (permalink / raw)
To: gcc-bugs
--
jsm28 at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
Target Milestone|--- |4.3.0
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (9 preceding siblings ...)
2007-10-07 15:17 ` hjl at lucon dot org
@ 2007-10-07 15:17 ` hjl at lucon dot org
2007-10-07 15:19 ` hjl at lucon dot org
` (51 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: hjl at lucon dot org @ 2007-10-07 15:17 UTC (permalink / raw)
To: gcc-bugs
--
hjl at lucon dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |xuepeng dot guo at intel dot
| |com
Status|WAITING |NEW
Ever Confirmed|0 |1
Last reconfirmed|2007-10-07 15:16:52 |2007-10-07 15:17:20
date| |
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (8 preceding siblings ...)
2007-09-11 11:48 ` jsm28 at gcc dot gnu dot org
@ 2007-10-07 15:17 ` hjl at lucon dot org
2007-10-07 15:17 ` hjl at lucon dot org
` (52 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: hjl at lucon dot org @ 2007-10-07 15:17 UTC (permalink / raw)
To: gcc-bugs
------- Comment #9 from hjl at lucon dot org 2007-10-07 15:16 -------
Gcc 4.3 revision 128885 is much slower than gcc 4.1 redhat revision 128771
with -O2 -ffast-math on 64bit Linux/Core 2 Duo:
437.leslie3d -26.2%
--
hjl at lucon dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |Joey dot ye at intel dot com
Last reconfirmed|2007-08-27 20:23:04 |2007-10-07 15:16:52
date| |
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (10 preceding siblings ...)
2007-10-07 15:17 ` hjl at lucon dot org
@ 2007-10-07 15:19 ` hjl at lucon dot org
2007-10-10 17:48 ` mmitchel at gcc dot gnu dot org
` (50 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: hjl at lucon dot org @ 2007-10-07 15:19 UTC (permalink / raw)
To: gcc-bugs
------- Comment #10 from hjl at lucon dot org 2007-10-07 15:19 -------
(In reply to comment #9)
> Gcc 4.3 revision 128885 is much slower than gcc 4.1 redhat revision 128771
> with -O2 -ffast-math on 64bit Linux/Core 2 Duo:
>
> 437.leslie3d -26.2%
>
The main difference between gcc 4.1 redhat and gcc 4.1 is the -mtune=generic
change. I believe gcc 4.3 will also be much slower than gcc 4.1 on Opteron.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (11 preceding siblings ...)
2007-10-07 15:19 ` hjl at lucon dot org
@ 2007-10-10 17:48 ` mmitchel at gcc dot gnu dot org
2007-10-17 16:14 ` pthaugen at gcc dot gnu dot org
` (49 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: mmitchel at gcc dot gnu dot org @ 2007-10-10 17:48 UTC (permalink / raw)
To: gcc-bugs
--
mmitchel at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
Priority|P3 |P2
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (12 preceding siblings ...)
2007-10-10 17:48 ` mmitchel at gcc dot gnu dot org
@ 2007-10-17 16:14 ` pthaugen at gcc dot gnu dot org
2007-10-17 16:19 ` pthaugen at gcc dot gnu dot org
` (48 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: pthaugen at gcc dot gnu dot org @ 2007-10-17 16:14 UTC (permalink / raw)
To: gcc-bugs
------- Comment #11 from pthaugen at gcc dot gnu dot org 2007-10-17 16:14 -------
Created an attachment (id=14364)
--> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=14364&action=view)
Testcase
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (13 preceding siblings ...)
2007-10-17 16:14 ` pthaugen at gcc dot gnu dot org
@ 2007-10-17 16:19 ` pthaugen at gcc dot gnu dot org
2007-10-17 16:59 ` pinskia at gcc dot gnu dot org
` (47 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: pthaugen at gcc dot gnu dot org @ 2007-10-17 16:19 UTC (permalink / raw)
To: gcc-bugs
------- Comment #12 from pthaugen at gcc dot gnu dot org 2007-10-17 16:18 -------
And now some comments to go with the prior attatchment...
This checkin is causing a 75% degradation on leslie3d for PowerPC. As HJ
observed earlier, it depends on a second function accessing some of the global
data.
Generated code for simple copy loop in EXTRAPI(), compiled with -O2.
revision 126325:
.p2align 4,,15
.L3:
lfd 0,0(11) #* ivtmp.71, tmp198
add 11,11,6 # ivtmp.71, ivtmp.71, ivtmp.77
stfd 0,0(9) #* ivtmp.76, tmp198
add 9,9,5 # ivtmp.76, ivtmp.76, ivtmp.73
bdnz .L3 #
revision 126326:
.p2align 4,,15
.L6:
lwz 10,12(27) # <variable>.stride, prephitmp.66
lwz 3,12(28) # <variable>.stride, prephitmp.56
.L3:
mullw 10,10,12 # tmp141, prephitmp.66, i
lwz 9,36(30) # <variable>.stride, <variable>.stride
lwz 0,36(31) # <variable>.stride, <variable>.stride
lwz 8,4(30) # uav.offset, uav.offset
lwz 11,4(31) # qav.offset, qav.offset
lwz 6,24(30) # <variable>.stride, <variable>.stride
lwz 7,24(31) # <variable>.stride, <variable>.stride
lwz 4,0(30) # uav.data, uav.data
lwz 5,0(31) # qav.data, qav.data
mullw 3,3,12 # tmp160, prephitmp.56, i
slwi 9,9,1 # tmp142, <variable>.stride,
slwi 0,0,1 # tmp161, <variable>.stride,
add 9,9,8 # tmp143, tmp142, uav.offset
add 0,0,11 # tmp162, tmp161, qav.offset
add 9,9,6 # tmp145, tmp143, <variable>.stride
add 0,0,7 # tmp164, tmp162, <variable>.stride
add 9,9,10 # tmp147, tmp145, tmp141
add 0,0,3 # tmp166, tmp164, tmp160
slwi 9,9,3 # tmp148, tmp147,
slwi 0,0,3 # tmp167, tmp166,
addi 12,12,1 # i, i,
lfdx 0,5,0 #* qav.data, tmp169
cmpw 7,12,29 # imax.3, tmp170, i
stfdx 0,9,4 #* uav.data, tmp169
bne 7,.L6 #
--
pthaugen at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |pthaugen at gcc dot gnu dot
| |org, bergner at gcc dot gnu
| |dot org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (14 preceding siblings ...)
2007-10-17 16:19 ` pthaugen at gcc dot gnu dot org
@ 2007-10-17 16:59 ` pinskia at gcc dot gnu dot org
2007-10-17 17:41 ` dberlin at dberlin dot org
` (46 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2007-10-17 16:59 UTC (permalink / raw)
To: gcc-bugs
------- Comment #13 from pinskia at gcc dot gnu dot org 2007-10-17 16:59 -------
Can someone explain this may_alias behavior:
so we have in the IR:
# VUSE <SFT.30_53>
D.892_19 = qav.data;
D.893_20 = (real8[0:] *) D.892_19;
in may_alias we get a constraint of:
D.892_19 = qav
D.893_20 = D.892_19
Isn't this wrong? D. 892_19 shouldn't just point to anywhere?
Inside the loop we get:
# VUSE <imax_33, SFT.11_34, SFT.12_35, SFT.13_36, SFT.14_37, SFT.15_38,
SFT.16_39, SFT.17_40, SFT.18_41, SFT.19_42, SFT.20_43, SFT.21_44, SFT.22_45,
SFT.23_46, SFT.24_47, SFT.25_48, SFT.26_49, SFT.27_50, SFT.28_51, SFT.29_52,
SFT.30_53, SMT.35_54>
D.903_30 = (*D.893_20)[D.902_29];
Which obviously shows that the vops are wrong.
--
pinskia at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |dberlin at gcc dot gnu dot
| |org
Keywords| |missed-optimization
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (15 preceding siblings ...)
2007-10-17 16:59 ` pinskia at gcc dot gnu dot org
@ 2007-10-17 17:41 ` dberlin at dberlin dot org
2007-10-17 21:22 ` rguenth at gcc dot gnu dot org
` (45 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: dberlin at dberlin dot org @ 2007-10-17 17:41 UTC (permalink / raw)
To: gcc-bugs
------- Comment #14 from dberlin at gcc dot gnu dot org 2007-10-17 17:41 -------
Subject: Re: [4.3 Regression] Revision 126326 causes 12% slowdown
On 17 Oct 2007 16:59:25 -0000, pinskia at gcc dot gnu dot org
<gcc-bugzilla@gcc.gnu.org> wrote:
>
>
> ------- Comment #13 from pinskia at gcc dot gnu dot org 2007-10-17 16:59 -------
> Can someone explain this may_alias behavior:
>
> so we have in the IR:
> # VUSE <SFT.30_53>
> D.892_19 = qav.data;
> D.893_20 = (real8[0:] *) D.892_19;
>
>
> in may_alias we get a constraint of:
> D.892_19 = qav
> D.893_20 = D.892_19
>
>
>
> Isn't this wrong? D. 892_19 shouldn't just point to anywhere?
qav should have a constraint somewhere that says qav = &ANYTHING
You won't directly see D.892_19 = ANYTHING
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (16 preceding siblings ...)
2007-10-17 17:41 ` dberlin at dberlin dot org
@ 2007-10-17 21:22 ` rguenth at gcc dot gnu dot org
2007-10-18 11:45 ` rguenth at gcc dot gnu dot org
` (44 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2007-10-17 21:22 UTC (permalink / raw)
To: gcc-bugs
------- Comment #15 from rguenth at gcc dot gnu dot org 2007-10-17 21:21 -------
comment #12 hints at that this is really the same problem as PR32624 (which
basically says aliasing is fucked up and non-deterministic).
--
rguenth at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |dnovillo at gcc dot gnu dot
| |org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (17 preceding siblings ...)
2007-10-17 21:22 ` rguenth at gcc dot gnu dot org
@ 2007-10-18 11:45 ` rguenth at gcc dot gnu dot org
2007-10-18 20:55 ` hjl at lucon dot org
` (43 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2007-10-18 11:45 UTC (permalink / raw)
To: gcc-bugs
------- Comment #16 from rguenth at gcc dot gnu dot org 2007-10-18 11:45 -------
I have a patch.
--
rguenth at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
AssignedTo|unassigned at gcc dot gnu |rguenth at gcc dot gnu dot
|dot org |org
Status|NEW |ASSIGNED
Last reconfirmed|2007-10-07 15:17:20 |2007-10-18 11:45:39
date| |
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (18 preceding siblings ...)
2007-10-18 11:45 ` rguenth at gcc dot gnu dot org
@ 2007-10-18 20:55 ` hjl at lucon dot org
2007-10-19 11:26 ` rguenth at gcc dot gnu dot org
` (42 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: hjl at lucon dot org @ 2007-10-18 20:55 UTC (permalink / raw)
To: gcc-bugs
------- Comment #17 from hjl at lucon dot org 2007-10-18 20:55 -------
This patch:
http://gcc.gnu.org/ml/gcc-patches/2007-10/msg01047.html
makes 437.leslie3d 10% faster on Intel Core 2 Duo 64bit. But it is still 23%
slower than gcc 4.1 Red Hat.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (19 preceding siblings ...)
2007-10-18 20:55 ` hjl at lucon dot org
@ 2007-10-19 11:26 ` rguenth at gcc dot gnu dot org
2007-10-19 12:28 ` rguenth at gcc dot gnu dot org
` (41 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2007-10-19 11:26 UTC (permalink / raw)
To: gcc-bugs
------- Comment #18 from rguenth at gcc dot gnu dot org 2007-10-19 11:26 -------
Subject: Bug 32921
Author: rguenth
Date: Fri Oct 19 11:25:55 2007
New Revision: 129484
URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=129484
Log:
2007-10-19 Richard Guenther <rguenther@suse.de>
PR middle-end/32921
* tree.c (build_array_type): Do not re-layout unbound array
types.
* gfortran.dg/pr32921.f: New testcase.
Added:
trunk/gcc/testsuite/gfortran.dg/pr32921.f
Modified:
trunk/gcc/ChangeLog
trunk/gcc/testsuite/ChangeLog
trunk/gcc/tree.c
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (20 preceding siblings ...)
2007-10-19 11:26 ` rguenth at gcc dot gnu dot org
@ 2007-10-19 12:28 ` rguenth at gcc dot gnu dot org
2007-10-19 12:40 ` rguenth at gcc dot gnu dot org
` (40 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2007-10-19 12:28 UTC (permalink / raw)
To: gcc-bugs
------- Comment #19 from rguenth at gcc dot gnu dot org 2007-10-19 12:27 -------
Subject: Bug 32921
Author: rguenth
Date: Fri Oct 19 12:27:25 2007
New Revision: 129487
URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=129487
Log:
2007-10-19 Richard Guenther <rguenther@suse.de>
Revert
2007-10-19 Richard Guenther <rguenther@suse.de>
PR middle-end/32921
* tree.c (build_array_type): Do not re-layout unbound array
types.
* gfortran.dg/pr32921.f: New testcase.
Removed:
trunk/gcc/testsuite/gfortran.dg/pr32921.f
Modified:
trunk/gcc/ChangeLog
trunk/gcc/testsuite/ChangeLog
trunk/gcc/tree.c
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (21 preceding siblings ...)
2007-10-19 12:28 ` rguenth at gcc dot gnu dot org
@ 2007-10-19 12:40 ` rguenth at gcc dot gnu dot org
2007-10-19 15:36 ` rguenth at gcc dot gnu dot org
` (39 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2007-10-19 12:40 UTC (permalink / raw)
To: gcc-bugs
------- Comment #20 from rguenth at gcc dot gnu dot org 2007-10-19 12:40 -------
Complete mess. Can of worms. Unassigning.
--
rguenth at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
AssignedTo|rguenth at gcc dot gnu dot |unassigned at gcc dot gnu
|org |dot org
Status|ASSIGNED |NEW
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (22 preceding siblings ...)
2007-10-19 12:40 ` rguenth at gcc dot gnu dot org
@ 2007-10-19 15:36 ` rguenth at gcc dot gnu dot org
2007-10-19 16:28 ` rguenth at gcc dot gnu dot org
` (38 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2007-10-19 15:36 UTC (permalink / raw)
To: gcc-bugs
------- Comment #21 from rguenth at gcc dot gnu dot org 2007-10-19 15:36 -------
Subject: Bug 32921
Author: rguenth
Date: Fri Oct 19 15:36:05 2007
New Revision: 129491
URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=129491
Log:
2007-10-19 Richard Guenther <rguenther@suse.de>
PR middle-end/33816
PR middle-end/32921
* stor-layout.c (layout_type): Assert that aggregates do not
have their alias sets set.
* alias.c (get_alias_set): Return alias set zero for incomplete
types, return the alias set of the element for incomplete array
types, but do not remember these.
* gfortran.dg/pr32921.f: New testcase.
Added:
trunk/gcc/testsuite/gfortran.dg/pr32921.f
Modified:
trunk/gcc/ChangeLog
trunk/gcc/alias.c
trunk/gcc/stor-layout.c
trunk/gcc/testsuite/ChangeLog
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (23 preceding siblings ...)
2007-10-19 15:36 ` rguenth at gcc dot gnu dot org
@ 2007-10-19 16:28 ` rguenth at gcc dot gnu dot org
2007-10-19 22:13 ` hjl at lucon dot org
` (37 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2007-10-19 16:28 UTC (permalink / raw)
To: gcc-bugs
------- Comment #22 from rguenth at gcc dot gnu dot org 2007-10-19 16:28 -------
Actually, the fix for PR33816 might have fixed this.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (24 preceding siblings ...)
2007-10-19 16:28 ` rguenth at gcc dot gnu dot org
@ 2007-10-19 22:13 ` hjl at lucon dot org
2007-10-20 4:21 ` patchapp at dberlin dot org
` (36 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: hjl at lucon dot org @ 2007-10-19 22:13 UTC (permalink / raw)
To: gcc-bugs
------- Comment #23 from hjl at lucon dot org 2007-10-19 22:13 -------
Gcc 4.3 revision 129493 makes 437.leslie3d 25% faster than revision 129372 on
Intel Core 2 Duo 64bit. But it is still 13% slower than gcc 4.1 Red Hat.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (25 preceding siblings ...)
2007-10-19 22:13 ` hjl at lucon dot org
@ 2007-10-20 4:21 ` patchapp at dberlin dot org
2007-10-20 9:39 ` rguenth at gcc dot gnu dot org
` (35 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: patchapp at dberlin dot org @ 2007-10-20 4:21 UTC (permalink / raw)
To: gcc-bugs
------- Comment #24 from patchapp at dberlin dot org 2007-10-20 04:21 -------
Subject: Bug number PR32921
A patch for this bug has been added to the patch tracker.
The mailing list url for the patch is
http://gcc.gnu.org/ml/gcc-patches/2007-10/msg01036.html
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (26 preceding siblings ...)
2007-10-20 4:21 ` patchapp at dberlin dot org
@ 2007-10-20 9:39 ` rguenth at gcc dot gnu dot org
2007-10-20 16:05 ` hjl at lucon dot org
` (34 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2007-10-20 9:39 UTC (permalink / raw)
To: gcc-bugs
------- Comment #25 from rguenth at gcc dot gnu dot org 2007-10-20 09:38 -------
HJ, does applying the patch from comment #6 bring back performance to 4.1 RH
level?
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (27 preceding siblings ...)
2007-10-20 9:39 ` rguenth at gcc dot gnu dot org
@ 2007-10-20 16:05 ` hjl at lucon dot org
2007-10-22 21:11 ` pthaugen at gcc dot gnu dot org
` (33 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: hjl at lucon dot org @ 2007-10-20 16:05 UTC (permalink / raw)
To: gcc-bugs
------- Comment #26 from hjl at lucon dot org 2007-10-20 16:05 -------
(In reply to comment #25)
> HJ, does applying the patch from comment #6 bring back performance to 4.1 RH
> level?
>
It makes no difference. I saw 20% slowdown in 437.leslie3d on Intel Core
2 Duo 64bit between revision 117890 and revision 119497. It could be
PR 30735.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (28 preceding siblings ...)
2007-10-20 16:05 ` hjl at lucon dot org
@ 2007-10-22 21:11 ` pthaugen at gcc dot gnu dot org
2007-10-23 2:23 ` Joey dot ye at intel dot com
` (32 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: pthaugen at gcc dot gnu dot org @ 2007-10-22 21:11 UTC (permalink / raw)
To: gcc-bugs
------- Comment #27 from pthaugen at gcc dot gnu dot org 2007-10-22 21:11 -------
I tried a recent mainline on PowerPC for leslie3d, revision 129550 improved
over revision 129454 by 67%.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (29 preceding siblings ...)
2007-10-22 21:11 ` pthaugen at gcc dot gnu dot org
@ 2007-10-23 2:23 ` Joey dot ye at intel dot com
2007-10-23 20:29 ` pthaugen at gcc dot gnu dot org
` (31 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: Joey dot ye at intel dot com @ 2007-10-23 2:23 UTC (permalink / raw)
To: gcc-bugs
------- Comment #28 from Joey dot ye at intel dot com 2007-10-23 02:23 -------
Got similar result on x86_64, Core 2 improves 24% from 129469 to 129504. That's
great.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (30 preceding siblings ...)
2007-10-23 2:23 ` Joey dot ye at intel dot com
@ 2007-10-23 20:29 ` pthaugen at gcc dot gnu dot org
2007-10-23 20:31 ` pthaugen at gcc dot gnu dot org
` (30 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: pthaugen at gcc dot gnu dot org @ 2007-10-23 20:29 UTC (permalink / raw)
To: gcc-bugs
------- Comment #29 from pthaugen at gcc dot gnu dot org 2007-10-23 20:29 -------
Found another example on PowerPC in the same benchmark that is not fixed by
the checked in patches. Compiled with -m32 -O2. From the loop in procedure
FLUXI:
revision 126325:
.L47:
lfd 0,0(9) #* ivtmp.277, tmp346
lfd 13,-8(9) #, tmp347
add 9,9,0 # ivtmp.277, ivtmp.277, ivtmp.309
fsub 0,0,13 # tmp345, tmp346, tmp347
fmul 0,0,12 # tmp348, tmp345, dtvol.23
fneg 0,0 # tmp349, tmp348
stfd 0,0(11) #* ivtmp.306, tmp349
add 11,11,10 # ivtmp.306, ivtmp.306, ivtmp.280
bdnz .L47 #
revision 126326 (and current mainline):
.L83:
lwz 0,48(7) # <variable>.stride, <variable>.stride
lfd 0,0(10) #* ivtmp.277, tmp416
lfd 13,-8(10) #, tmp417
lfd 12,0(5) # dtvol, dtvol
add 10,10,6 # ivtmp.277, ivtmp.277, ivtmp.306
mullw 0,8,0 # tmp409, l, <variable>.stride
fsub 0,0,13 # tmp415, tmp416, tmp417
lwz 9,4(7) # du.offset, du.offset
lwz 11,0(7) # du.data, du.data
fmul 0,0,12 # tmp420, tmp415, dtvol
fneg 0,0 # tmp422, tmp420
add 0,0,9 # tmp412, tmp409, du.offset
addi 8,8,1 # l, l,
slwi 0,0,3 # tmp413, tmp412,
stfdx 0,11,0 #, tmp422
bdnz .L83 #
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (31 preceding siblings ...)
2007-10-23 20:29 ` pthaugen at gcc dot gnu dot org
@ 2007-10-23 20:31 ` pthaugen at gcc dot gnu dot org
2007-10-24 12:23 ` rguenth at gcc dot gnu dot org
` (29 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: pthaugen at gcc dot gnu dot org @ 2007-10-23 20:31 UTC (permalink / raw)
To: gcc-bugs
------- Comment #30 from pthaugen at gcc dot gnu dot org 2007-10-23 20:30 -------
Created an attachment (id=14402)
--> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=14402&action=view)
Testcase
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (32 preceding siblings ...)
2007-10-23 20:31 ` pthaugen at gcc dot gnu dot org
@ 2007-10-24 12:23 ` rguenth at gcc dot gnu dot org
2007-10-24 15:02 ` rguenth at gcc dot gnu dot org
` (28 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2007-10-24 12:23 UTC (permalink / raw)
To: gcc-bugs
------- Comment #31 from rguenth at gcc dot gnu dot org 2007-10-24 12:23 -------
This is a partitioning issue. We allocate the same MPT to loads and stores
in a loop. Fixed with for example --param max-aliased-vops=10000.
Diego, this is yours. Avoiding false aliasing of loads and stores should
be top priority in the partitioning heuristics.
Though even with no paritioning we are not able to move the invariant load
of dtvol:
.L68:
movsd (%rsi), %xmm0
addl $1, %ecx
addq %rdi, %rsi
subsd (%rdx), %xmm0
addq %rdi, %rdx
mulsd __les3d_data_MOD_dtvol(%rip), %xmm0
xorpd %xmm1, %xmm0
movsd %xmm0, (%rax)
addq %r8, %rax
cmpl $6, %ecx
jne .L68
because:
# VUSE <dtvol_613>
dtvol.23_424 = dtvol;
D.1276_425 = D.1274_423 * dtvol.23_424;
D.1277_426 = -D.1276_425;
# dtvol_1865 = VDEF <dtvol_613>
# HEAP.185_1160 = VDEF <HEAP.185_609>
# HEAP.186_1142 = VDEF <HEAP.186_1194>
# HEAP.202_1108 = VDEF <HEAP.202_1177>
(*pretmp.248_955)[D.1259_397] = D.1277_426;
as we think that pretmp.248_955 might point to dtvol. The pointers just
have a constraint like
pretmp.246_921 = du
and
du = &ANYTHING
du.ubound = &ANYTHING
du.lbound = &ANYTHING
du.stride = &ANYTHING
du.ubound = &ANYTHING
du.lbound = &ANYTHING
du.stride = &ANYTHING
du.ubound = &ANYTHING
du.lbound = &ANYTHING
du.stride = &ANYTHING
du.ubound = &ANYTHING
du.lbound = &ANYTHING
du.stride = &ANYTHING
du.dtype = &ANYTHING
du.offset = &ANYTHING
and dtvol and DU are global variables. So we are possibly out of luck
for this one.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (33 preceding siblings ...)
2007-10-24 12:23 ` rguenth at gcc dot gnu dot org
@ 2007-10-24 15:02 ` rguenth at gcc dot gnu dot org
2007-10-24 15:18 ` rguenth at gcc dot gnu dot org
` (27 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2007-10-24 15:02 UTC (permalink / raw)
To: gcc-bugs
------- Comment #32 from rguenth at gcc dot gnu dot org 2007-10-24 15:02 -------
I have a patch to "fix" the heuristics, but it doesn't have effect as nobody
hoists PHI nodes that have become invariant apperantly. After PRE I see
<bb 100>:
# HEAP.202_828 = PHI <HEAP.202_747(150), HEAP.202_743(148)>
<bb 101>:
# SMT.222_639 = PHI <SMT.222_638(100), SMT.222_599(152)>
# dtvol_656 = PHI <dtvol_620(100), dtvol_655(152)>
# MPT.242_668 = PHI <MPT.242_744(100), MPT.242_668(152)>
# l_1 = PHI <1(100), l_428(152)>
# VUSE <MPT.242_668>
D.1241_376 = du.data;
... (more VUSEs of MPT.242_668, the only VDEFs in the loop follow)
# dtvol_655 = VDEF <dtvol_656>
# SMT.222_599 = VDEF <SMT.222_639>
(*D.1242_377)[D.1259_397] = D.1277_426;
l_428 = l_1 + 1;
if (l_1 == 5)
goto <bb 102>;
else
goto <bb 152>;
<bb 152>:
goto <bb 101>;
<bb 102>:
the
# SMT.222_639 = PHI <SMT.222_638(100), SMT.222_599(152)>
could be hoisted to BB100 (even to BB92). Who is supposed to do this?
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (34 preceding siblings ...)
2007-10-24 15:02 ` rguenth at gcc dot gnu dot org
@ 2007-10-24 15:18 ` rguenth at gcc dot gnu dot org
2007-10-24 15:19 ` rguenth at gcc dot gnu dot org
` (26 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2007-10-24 15:18 UTC (permalink / raw)
To: gcc-bugs
------- Comment #33 from rguenth at gcc dot gnu dot org 2007-10-24 15:18 -------
DOM does, so scheduling another dominator pass after PRE (where alias is
re-run)
fixes the problem. One problem with partitioning is that we don't collect
statistics for symbols inside a partition, which makes all but the first
paritioning run prefer already partitioned symbols.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (35 preceding siblings ...)
2007-10-24 15:18 ` rguenth at gcc dot gnu dot org
@ 2007-10-24 15:19 ` rguenth at gcc dot gnu dot org
2007-10-24 23:15 ` pthaugen at gcc dot gnu dot org
` (25 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2007-10-24 15:19 UTC (permalink / raw)
To: gcc-bugs
------- Comment #34 from rguenth at gcc dot gnu dot org 2007-10-24 15:19 -------
Created an attachment (id=14406)
--> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=14406&action=view)
partition heuristics change
The idea is to change the statistics to make the values in the fields
match their names and based on that avoid partitioning for variables
that are often directly accessed or indirectly accessed as mostly
reads or writes. It probably makes sense to change the field names
and weight the individual accesses with their BB frequency again.
Of course the patch is not finished and is untested apart from throwing
it on the 2nd testcase.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (36 preceding siblings ...)
2007-10-24 15:19 ` rguenth at gcc dot gnu dot org
@ 2007-10-24 23:15 ` pthaugen at gcc dot gnu dot org
2007-10-25 8:55 ` rguenth at gcc dot gnu dot org
` (24 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: pthaugen at gcc dot gnu dot org @ 2007-10-24 23:15 UTC (permalink / raw)
To: gcc-bugs
------- Comment #35 from pthaugen at gcc dot gnu dot org 2007-10-24 23:15 -------
I reran leslie3d on PowerPC with --param max-aliased-vops=10000. The result was
a 90% improvement over my prior revision 129550 run (218% improvement over the
original mainline run).
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (37 preceding siblings ...)
2007-10-24 23:15 ` pthaugen at gcc dot gnu dot org
@ 2007-10-25 8:55 ` rguenth at gcc dot gnu dot org
2007-10-25 10:06 ` pinskia at gcc dot gnu dot org
` (23 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2007-10-25 8:55 UTC (permalink / raw)
To: gcc-bugs
------- Comment #36 from rguenth at gcc dot gnu dot org 2007-10-25 08:55 -------
One reason why we see a regression here regarding to partitioning is that the
fortran FE now inlines allocate () producing three calls instead of one, which
spoils the partitioning heuristics:
{
void * D.1028;
logical4 D.1027;
int8 size.0;
int8 D.1025;
qs.dtype = 537;
qs.dim[0].lbound = 0;
qs.dim[0].ubound = (int8) (imax + -1);
qs.dim[0].stride = 1;
D.1025 = (int8) (imax + -1) + 1;
D.1027 = imax <= 0;
if (D.1027)
{
size.0 = 0;
}
else
{
size.0 = D.1025 * 8;
}
if (qs.data == 0B)
{
{
void * D.1030;
int8 D.1029;
D.1029 = size.0;
if (D.1029 < 0)
{
_gfortran_runtime_error (&"Attempt to allocate negative amount of
memory. Possible integer overflow"[1]{lb: 1 sz: 1});
}
else
{
D.1030 = __builtin_malloc (MAX_EXPR <D.1029, 1>);
if (D.1030 == 0B)
{
_gfortran_os_error (&"Out of memory"[1]{lb: 1 sz: 1});
}
}
D.1028 = D.1030;
}
}
else
{
_gfortran_runtime_error (&"Attempting to allocate already allocated
array"[1]{lb: 1 sz: 1});
}
qs.data = D.1028;
qs.offset = 0;
}
if we mark the error functions as having no VOPs (I don't see that we need
to preserve or order memory operations for these particular functions that
do not return) then this should clean up the number of VOPs considerably.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (38 preceding siblings ...)
2007-10-25 8:55 ` rguenth at gcc dot gnu dot org
@ 2007-10-25 10:06 ` pinskia at gcc dot gnu dot org
2007-10-25 11:02 ` rguenth at gcc dot gnu dot org
` (22 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2007-10-25 10:06 UTC (permalink / raw)
To: gcc-bugs
------- Comment #37 from pinskia at gcc dot gnu dot org 2007-10-25 10:05 -------
_gfortran_runtime_error is marked as no return which means virtual operations
should not be on it. Sounds like noreturn should be the same as no vops
(maybe). You have to take care about exceptions as no return functions can
still change global state and "return" via an exception (or a long jump).
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (39 preceding siblings ...)
2007-10-25 10:06 ` pinskia at gcc dot gnu dot org
@ 2007-10-25 11:02 ` rguenth at gcc dot gnu dot org
2007-10-25 11:07 ` rguenth at gcc dot gnu dot org
` (21 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2007-10-25 11:02 UTC (permalink / raw)
To: gcc-bugs
------- Comment #38 from rguenth at gcc dot gnu dot org 2007-10-25 11:02 -------
Another thing is that for all of the mem_sym_stats we collect, we _only_
consider
memory references through pointers(!), but not for example
# VUSE <MPT.242_651>
D.1244_380 = du.dim[0].stride;
the associated SFT will neither get indirect nor direct reads accounted.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (40 preceding siblings ...)
2007-10-25 11:02 ` rguenth at gcc dot gnu dot org
@ 2007-10-25 11:07 ` rguenth at gcc dot gnu dot org
2008-03-14 16:53 ` [Bug tree-optimization/32921] [4.3/4.4 " rguenth at gcc dot gnu dot org
` (20 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2007-10-25 11:07 UTC (permalink / raw)
To: gcc-bugs
------- Comment #39 from rguenth at gcc dot gnu dot org 2007-10-25 11:06 -------
No, in general noreturn functions cannot be treated as novops. Consider
void __attribute__((noreturn,noinline)) my_main(int *ret)
{
exit(*ret);
}
int main()
{
int ret = 0;
my_main (&ret);
}
without VOPs we would remove the store to ret.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (41 preceding siblings ...)
2007-10-25 11:07 ` rguenth at gcc dot gnu dot org
@ 2008-03-14 16:53 ` rguenth at gcc dot gnu dot org
2008-04-23 14:10 ` rguenth at gcc dot gnu dot org
` (19 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2008-03-14 16:53 UTC (permalink / raw)
To: gcc-bugs
------- Comment #40 from rguenth at gcc dot gnu dot org 2008-03-14 16:53 -------
Adjusting target milestone (though realistically this will not be fixed for
4.3.x).
--
rguenth at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
Known to fail| |4.3.0
Target Milestone|4.3.0 |4.3.1
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (42 preceding siblings ...)
2008-03-14 16:53 ` [Bug tree-optimization/32921] [4.3/4.4 " rguenth at gcc dot gnu dot org
@ 2008-04-23 14:10 ` rguenth at gcc dot gnu dot org
2008-04-23 14:10 ` rguenth at gcc dot gnu dot org
` (18 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2008-04-23 14:10 UTC (permalink / raw)
To: gcc-bugs
------- Comment #42 from rguenth at gcc dot gnu dot org 2008-04-23 14:09 -------
Likely fixed on the mainline.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (43 preceding siblings ...)
2008-04-23 14:10 ` rguenth at gcc dot gnu dot org
@ 2008-04-23 14:10 ` rguenth at gcc dot gnu dot org
2008-04-30 18:50 ` pthaugen at gcc dot gnu dot org
` (17 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2008-04-23 14:10 UTC (permalink / raw)
To: gcc-bugs
------- Comment #41 from rguenth at gcc dot gnu dot org 2008-04-23 14:09 -------
Subject: Bug 32921
Author: rguenth
Date: Wed Apr 23 14:08:25 2008
New Revision: 134598
URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=134598
Log:
2008-04-23 Richard Guenther <rguenther@suse.de>
PR tree-optimization/27799
PR tree-optimization/32921
PR tree-optimization/32624
* tree-ssa-structalias.c (merge_smts_into): Only merge the
SMTs aliases and the tag itself into the solution.
* tree-ssa-alias.c (compute_flow_sensitive_aliasing): Do not
merge the points-to solution back into the SMT aliases.
(may_alias_p): Use alias_set_subset_of instead of
aliases_conflict_p. A pointer which points to
memory with alias set zero may access any variable.
* gcc.dg/tree-ssa/pr27799.c: New testcase.
* gcc.dg/tree-ssa/20030807-7.c: Remove xfail, scan vrp dump.
Added:
trunk/gcc/testsuite/gcc.dg/tree-ssa/pr27799.c
Modified:
trunk/gcc/ChangeLog
trunk/gcc/testsuite/ChangeLog
trunk/gcc/testsuite/gcc.dg/tree-ssa/20030807-7.c
trunk/gcc/tree-ssa-alias.c
trunk/gcc/tree-ssa-structalias.c
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (44 preceding siblings ...)
2008-04-23 14:10 ` rguenth at gcc dot gnu dot org
@ 2008-04-30 18:50 ` pthaugen at gcc dot gnu dot org
2008-04-30 20:30 ` rguenth at gcc dot gnu dot org
` (16 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: pthaugen at gcc dot gnu dot org @ 2008-04-30 18:50 UTC (permalink / raw)
To: gcc-bugs
------- Comment #43 from pthaugen at gcc dot gnu dot org 2008-04-30 18:49 -------
Created an attachment (id=15553)
--> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=15553&action=view)
Testcase
I tried a mainline with the latest patch. While we no longer have problems
with the prior testcases, there is no improvement for leslie3d on ppc64. I can
still double the performance of the benchmark by specifying --param
max-aliased-vops=10000.
Including a new trimmed down testcase from the benchmark where I'm still seeing
poor code when max-aliased-vops is not increased, compiled with 'gfortran -m32
-O2'.
Refer to the first nested loop in procedure FLUXK():
DO I = I1, I2
QS(I) = WAV(I,J,K) * ZAREA
END DO
Base:
.L150:
lwz 0,24(18) # <variable>.stride, <variable>.stride
lwz 9,36(18) # <variable>.stride, <variable>.stride
lwz 11,12(18) # <variable>.stride, <variable>.stride
lwz 10,4(18) # wav.offset, wav.offset
mullw 0,17,0 # tmp660, ivtmp.602, <variable>.stride
lwz 8,0(18) # wav.data, wav.data
mullw 9,30,9 # tmp666, ivtmp.590, <variable>.stride
mullw 11,6,11 # tmp670, i, <variable>.stride
add 0,0,9 # tmp672, tmp660, tmp666
addi 6,6,1 # i, i,
add 0,0,11 # tmp673, tmp672, tmp670
add 0,0,10 # tmp674, tmp673, wav.offset
slwi 0,0,3 # tmp676, tmp674,
lfdx 0,8,0 #, tmp678
fmul 0,0,8 # tmp679, tmp678, zarea.64
stfdx 0,15,7 #* ivtmp.589, tmp679
addi 7,7,8 # ivtmp.589, ivtmp.589,
bdnz .L150 #
With --param max-aliased-vops=1000:
.L150:
lfd 0,0(11) #* ivtmp.599, tmp739
add 11,11,30 # ivtmp.599, ivtmp.599, D.2783
fmul 0,0,8 # tmp740, tmp739, zarea.64
stfdx 0,22,9 #* ivtmp.602, tmp740
addi 9,9,8 # ivtmp.602, ivtmp.602,
bdnz .L150 #
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (45 preceding siblings ...)
2008-04-30 18:50 ` pthaugen at gcc dot gnu dot org
@ 2008-04-30 20:30 ` rguenth at gcc dot gnu dot org
2008-04-30 21:43 ` rguenth at gcc dot gnu dot org
` (15 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2008-04-30 20:30 UTC (permalink / raw)
To: gcc-bugs
------- Comment #44 from rguenth at gcc dot gnu dot org 2008-04-30 20:29 -------
We are putting the HEAP tag for D.914_385 and wav in the same partition:
<bb 198>:
# MPT.501_1171 = PHI <MPT.501_1167(197), MPT.501_1228(199)>
# i_25 = PHI <1(197), i_646(199)>
D.1264_629 = i_25 + -1;
# VUSE <MPT.501_1171>
D.1265_630 = wav.data;
D.1266_631 = (real(kind=8)[0:] *) D.1265_630;
# VUSE <MPT.501_1171>
D.1267_632 = wav.dim[0].stride;
D.1268_633 = i_25 * D.1267_632;
# VUSE <MPT.501_1171>
D.1269_634 = wav.dim[1].stride;
D.1270_635 = j_28 * D.1269_634;
# VUSE <MPT.501_1171>
D.1272_637 = wav.dim[2].stride;
D.1273_638 = k_29 * D.1272_637;
# VUSE <MPT.501_1171>
D.1275_640 = wav.offset;
D.1271_636 = D.1270_635 + D.1275_640;
D.1274_639 = D.1271_636 + D.1273_638;
D.1276_641 = D.1274_639 + D.1268_633;
# VUSE <dxi_301, zarea_284, SMT.480_250>
D.1277_642 = (*D.1266_631)[D.1276_641];
D.1279_644 = D.1277_642 * zarea.64_643;
# MPT.501_1228 = VDEF <MPT.501_1171>
(*D.914_385)[D.1264_629] = D.1279_644;
i_646 = i_25 + 1;
if (i_25 == i2_608)
A shorter testcase doesn't reproduce this bogus decision even with
--param max-aliased-vops=0. LIM should be able to disambiguate here
with TBAA. And indeed, the following seems to fix it
Index: tree-ssa-loop-im.c
===================================================================
--- tree-ssa-loop-im.c (revision 134835)
+++ tree-ssa-loop-im.c (working copy)
@@ -1640,6 +1640,8 @@ mem_refs_may_alias_p (tree mem1, tree me
&& SSA_VAR_P (mem1)
&& !AGGREGATE_TYPE_P (TREE_TYPE (mem1)))
return false;
+ if (!alias_sets_conflict_p (get_alias_set (mem1), get_alias_set (mem2)))
+ return false;
}
/* The expansion of addresses may be a bit expensive, thus we only do
Pat - can you confirm this?
--
rguenth at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
AssignedTo|unassigned at gcc dot gnu |rguenth at gcc dot gnu dot
|dot org |org
Status|NEW |ASSIGNED
Last reconfirmed|2007-10-18 11:45:39 |2008-04-30 20:29:48
date| |
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (46 preceding siblings ...)
2008-04-30 20:30 ` rguenth at gcc dot gnu dot org
@ 2008-04-30 21:43 ` rguenth at gcc dot gnu dot org
2008-04-30 22:36 ` pthaugen at gcc dot gnu dot org
` (14 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2008-04-30 21:43 UTC (permalink / raw)
To: gcc-bugs
------- Comment #45 from rguenth at gcc dot gnu dot org 2008-04-30 21:43 -------
Subject: Bug 32921
Author: rguenth
Date: Wed Apr 30 21:42:24 2008
New Revision: 134838
URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=134838
Log:
2008-04-30 Richard Guenther <rguenther@suse.de>
PR tree-optimization/32921
* tree-ssa-loop-im.c (mem_refs_may_alias_p): Disambiguate with TBAA.
Modified:
trunk/gcc/ChangeLog
trunk/gcc/tree-ssa-loop-im.c
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (47 preceding siblings ...)
2008-04-30 21:43 ` rguenth at gcc dot gnu dot org
@ 2008-04-30 22:36 ` pthaugen at gcc dot gnu dot org
2008-05-01 19:27 ` pthaugen at gcc dot gnu dot org
` (13 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: pthaugen at gcc dot gnu dot org @ 2008-04-30 22:36 UTC (permalink / raw)
To: gcc-bugs
------- Comment #46 from pthaugen at gcc dot gnu dot org 2008-04-30 22:35 -------
Just following up with comments posted on irc. The patch does fix the problem
I was seeing. Spec ratio improved from 5.18 to 9.07 with the patch (75%), not
quite the full 100% improvement I was seeing with --param
max-aliased-vops=10000, but getting awfully close.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (48 preceding siblings ...)
2008-04-30 22:36 ` pthaugen at gcc dot gnu dot org
@ 2008-05-01 19:27 ` pthaugen at gcc dot gnu dot org
2008-05-02 10:29 ` rguenth at gcc dot gnu dot org
` (12 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: pthaugen at gcc dot gnu dot org @ 2008-05-01 19:27 UTC (permalink / raw)
To: gcc-bugs
------- Comment #47 from pthaugen at gcc dot gnu dot org 2008-05-01 19:26 -------
Created an attachment (id=15557)
--> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=15557&action=view)
Testcase
Found some more cases, which are contributing to the missing 25%. Looks like
similar code, but obviously something different about it since it's not getting
caught with the latest patch. Compiled with -O2 -m64.
>From first nested loop in FLUXI():
DO I = I1,I2
QS(I) = UAV(I,J,K) * XAREA
END DO
Base:
.L185:
ld 10,168(1) # pretmp.1065,
ld 4,424(1) # D.1331,
sldi 11,9,3 # tmp667, i,
mulld 0,10,9 # tmp669,, i
add 0,0,25 # tmp670, tmp669, ivtmp.1093
addi 9,9,1 # tmp675, i,
sldi 0,0,3 # tmp671, tmp670,
lfdx 0,4,0 #, tmp673
extsw 9,9 # i.1073, tmp675
fmul 0,0,28 # tmp674, tmp673, xarea.63
stfdx 0,26,11 #, tmp674
bdnz .L185 #
With --param max-aliased-vops=10000:
.L185:
lfd 0,0(11) #* ivtmp.886, tmp667
add 11,11,18 # ivtmp.886, ivtmp.886, D.3744
fmul 0,0,27 # tmp668, tmp667, xarea.63
stfdx 0,31,9 #* ivtmp.889, tmp668
addi 9,9,8 # ivtmp.889, ivtmp.889,
cmpd 7,9,29 # tmp787, tmp673, ivtmp.889
bne 7,.L185 #
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (49 preceding siblings ...)
2008-05-01 19:27 ` pthaugen at gcc dot gnu dot org
@ 2008-05-02 10:29 ` rguenth at gcc dot gnu dot org
2008-05-02 12:17 ` rguenth at gcc dot gnu dot org
` (11 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2008-05-02 10:29 UTC (permalink / raw)
To: gcc-bugs
------- Comment #48 from rguenth at gcc dot gnu dot org 2008-05-02 10:28 -------
Btw, on x86_64 leslie3d performance is now above that from before r126326.
The differences you mention can be seen on x86_64 as well, but they are
not related to aliasing or partitioning but due to differences in what
IVOPTs produces.
The first difference at all on the tree level is with mergephi2 that is able
to remove a single forwarder BB if --param max-aliased-vops=10000 is _not_
specified. First real changes happen when DOM is able to CSE some loads
with --param max-aliased-vops=10000 but not without:
- D.1747_808 = D.1747_763;
- D.1748_809 = D.1748_764;
- D.1749_811 = D.1749_766;
- D.1750_812 = D.1749_766 * D.1644_799;
+ D.1747_808 = pav.data;
+ D.1748_809 = (real(kind=8)[0:] *) D.1747_808;
+ D.1749_811 = pav.dim[0].stride;
+ D.1750_812 = D.1644_799 * D.1749_811;
and as PRE itself is not alias-oracle aware as well further missed
optimizations
occur.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (50 preceding siblings ...)
2008-05-02 10:29 ` rguenth at gcc dot gnu dot org
@ 2008-05-02 12:17 ` rguenth at gcc dot gnu dot org
2008-05-02 12:33 ` dnovillo at google dot com
` (10 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2008-05-02 12:17 UTC (permalink / raw)
To: gcc-bugs
------- Comment #49 from rguenth at gcc dot gnu dot org 2008-05-02 12:16 -------
Missing jump-threading causes quite a number of missed FRE opportunities, we
have
if (i2 >= 0)
{
... = load X
}
if (i2 >= 0)
{
... = load X
}
where we figure out the redundancy but don't do the replacement because
the result of the first load is not available at the place of the second load.
Some pass reordering may fix this and similar problems but of course may
have other side-effects. My suggestion is to move passes from
fre, vrp, ch
to
ch, vrp, fre
and dropping the final dom pass in favor of another FRE one (DOM has
weaker memory optimization but in addition does some jump threading and
cond expr combining, both of which don't happen a lot after loop
optimizations).
But pass reordering has to wait until after the statistics patch has
been approved.
Note that teaching DOM about the alias-oracle is very difficult and
unlikely to happen - I'd rather get rid of DOM completely.
--
rguenth at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
BugsThisDependsOn| |34416, 35972
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (51 preceding siblings ...)
2008-05-02 12:17 ` rguenth at gcc dot gnu dot org
@ 2008-05-02 12:33 ` dnovillo at google dot com
2008-05-02 12:56 ` rguenther at suse dot de
` (9 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: dnovillo at google dot com @ 2008-05-02 12:33 UTC (permalink / raw)
To: gcc-bugs
------- Comment #50 from dnovillo at google dot com 2008-05-02 12:32 -------
Subject: Re: [4.3/4.4 Regression] Revision
126326 causes 12% slowdown
On 05/02/08 08:16, rguenth at gcc dot gnu dot org wrote:
> and dropping the final dom pass in favor of another FRE one (DOM has
> weaker memory optimization but in addition does some jump threading and
> cond expr combining, both of which don't happen a lot after loop
> optimizations).
Along these lines, I think we would really benefit from doing a thorough
study of pass reordering using the techniques in GCC-ICI (GCC Summit
2008) and COLE (CGO 2008). Both have found significant improvements
with different optimization pipelines.
> Note that teaching DOM about the alias-oracle is very difficult and
> unlikely to happen - I'd rather get rid of DOM completely.
Agreed.
Diego.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (52 preceding siblings ...)
2008-05-02 12:33 ` dnovillo at google dot com
@ 2008-05-02 12:56 ` rguenther at suse dot de
2008-05-02 14:16 ` mmitchel at gcc dot gnu dot org
` (8 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenther at suse dot de @ 2008-05-02 12:56 UTC (permalink / raw)
To: gcc-bugs
------- Comment #51 from rguenther at suse dot de 2008-05-02 12:55 -------
Subject: Re: [4.3/4.4 Regression] Revision
126326 causes 12% slowdown
On Fri, 2 May 2008, dnovillo at google dot com wrote:
> ------- Comment #50 from dnovillo at google dot com 2008-05-02 12:32 -------
> Subject: Re: [4.3/4.4 Regression] Revision
> 126326 causes 12% slowdown
>
> On 05/02/08 08:16, rguenth at gcc dot gnu dot org wrote:
>
> > and dropping the final dom pass in favor of another FRE one (DOM has
> > weaker memory optimization but in addition does some jump threading and
> > cond expr combining, both of which don't happen a lot after loop
> > optimizations).
>
> Along these lines, I think we would really benefit from doing a thorough
> study of pass reordering using the techniques in GCC-ICI (GCC Summit
> 2008) and COLE (CGO 2008). Both have found significant improvements
> with different optimization pipelines.
If somebody wants to do the legwork ... (the study probably needs to
be re-done and weight passes properly with "we like this pass" and
"we don't like this pass"). Just by visual inspection and reasoning
I can come up with some reasonable changes -- that probably expose
missed optimizations that can and should be fixed in the proper
passes rather than relying on others to cleanup (see the CCP bugs
I tackled over the last month -- I expect similar stuff in other
passes).
The question is how to go forward here. As any re-ordering
that makes sense appearantly needs thorough analysis of regressions
that happen this isn't an automatic process. Automatic processes
either require "perfect" passes or will come up with solutions that
either still regress or contain a lot more passes than we like
(in possible un-intuitive order).
Richard.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (53 preceding siblings ...)
2008-05-02 12:56 ` rguenther at suse dot de
@ 2008-05-02 14:16 ` mmitchel at gcc dot gnu dot org
2008-05-02 15:06 ` rguenther at suse dot de
` (7 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: mmitchel at gcc dot gnu dot org @ 2008-05-02 14:16 UTC (permalink / raw)
To: gcc-bugs
------- Comment #52 from mmitchel at gcc dot gnu dot org 2008-05-02 14:16 -------
Yes, the "perfect pass" problem is what concerns me too. For example, if we
try to do dynamic reordering of passes, or allow users to specify that, we have
to worry that, in practice, the compiler will crash or generate wrong code.
We'll have no good way of ever validating even a small set of the possible
combinations.
Perhaps we need to make the passes fast, so we can run them more often? Or
weave some of them together, even though of course it's nice if each pass is
logically separate and does a single thing?
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (54 preceding siblings ...)
2008-05-02 14:16 ` mmitchel at gcc dot gnu dot org
@ 2008-05-02 15:06 ` rguenther at suse dot de
2008-06-06 14:59 ` rguenth at gcc dot gnu dot org
` (6 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenther at suse dot de @ 2008-05-02 15:06 UTC (permalink / raw)
To: gcc-bugs
------- Comment #53 from rguenther at suse dot de 2008-05-02 15:05 -------
Subject: Re: [4.3/4.4 Regression] Revision
126326 causes 12% slowdown
On Fri, 2 May 2008, mmitchel at gcc dot gnu dot org wrote:
> ------- Comment #52 from mmitchel at gcc dot gnu dot org 2008-05-02 14:16 -------
> Yes, the "perfect pass" problem is what concerns me too. For example, if we
> try to do dynamic reordering of passes, or allow users to specify that, we have
> to worry that, in practice, the compiler will crash or generate wrong code.
> We'll have no good way of ever validating even a small set of the possible
> combinations.
True. Still if we don't have this ability to easily re-order passes
we'll never know ;)
So I still would like to have our pass-manager scripted, if it is
only for development purposes of GCC.
> Perhaps we need to make the passes fast, so we can run them more often? Or
> weave some of them together, even though of course it's nice if each pass is
> logically separate and does a single thing?
Most of the cleanup passes are fast, and one of the most important things
in my view is maintainability which helps to reduce possible wrong-code
bugs. This of course is easier with passes that do one single thing.
One thing we should consider is to keep more information around across
passes to make passes simpler. For example VRP throws away range
information - we could separate out jump threading if we didn't do that,
likewise other passes could benefit from this information. FRE and PRE
compute value numbers which are also thrown away.
Richard.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (55 preceding siblings ...)
2008-05-02 15:06 ` rguenther at suse dot de
@ 2008-06-06 14:59 ` rguenth at gcc dot gnu dot org
2008-08-27 22:04 ` jsm28 at gcc dot gnu dot org
` (5 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2008-06-06 14:59 UTC (permalink / raw)
To: gcc-bugs
------- Comment #54 from rguenth at gcc dot gnu dot org 2008-06-06 14:57 -------
4.3.1 is being released, adjusting target milestone.
--
rguenth at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
Target Milestone|4.3.1 |4.3.2
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (56 preceding siblings ...)
2008-06-06 14:59 ` rguenth at gcc dot gnu dot org
@ 2008-08-27 22:04 ` jsm28 at gcc dot gnu dot org
2008-12-08 15:41 ` rguenth at gcc dot gnu dot org
` (4 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: jsm28 at gcc dot gnu dot org @ 2008-08-27 22:04 UTC (permalink / raw)
To: gcc-bugs
------- Comment #55 from jsm28 at gcc dot gnu dot org 2008-08-27 22:02 -------
4.3.2 is released, changing milestones to 4.3.3.
--
jsm28 at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
Target Milestone|4.3.2 |4.3.3
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (57 preceding siblings ...)
2008-08-27 22:04 ` jsm28 at gcc dot gnu dot org
@ 2008-12-08 15:41 ` rguenth at gcc dot gnu dot org
2009-01-24 10:22 ` rguenth at gcc dot gnu dot org
` (3 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2008-12-08 15:41 UTC (permalink / raw)
To: gcc-bugs
------- Comment #56 from rguenth at gcc dot gnu dot org 2008-12-08 15:37 -------
Unassigning.
--
rguenth at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
AssignedTo|rguenth at gcc dot gnu dot |unassigned at gcc dot gnu
|org |dot org
Status|ASSIGNED |NEW
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (58 preceding siblings ...)
2008-12-08 15:41 ` rguenth at gcc dot gnu dot org
@ 2009-01-24 10:22 ` rguenth at gcc dot gnu dot org
2009-01-31 14:27 ` [Bug tree-optimization/32921] [4.3 " bonzini at gnu dot org
` (2 subsequent siblings)
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2009-01-24 10:22 UTC (permalink / raw)
To: gcc-bugs
------- Comment #57 from rguenth at gcc dot gnu dot org 2009-01-24 10:19 -------
GCC 4.3.3 is being released, adjusting target milestone.
--
rguenth at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
Target Milestone|4.3.3 |4.3.4
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (59 preceding siblings ...)
2009-01-24 10:22 ` rguenth at gcc dot gnu dot org
@ 2009-01-31 14:27 ` bonzini at gnu dot org
2009-01-31 14:36 ` [Bug tree-optimization/32921] [4.3/4.4 " rguenth at gcc dot gnu dot org
2009-02-02 19:17 ` pthaugen at gcc dot gnu dot org
62 siblings, 0 replies; 64+ messages in thread
From: bonzini at gnu dot org @ 2009-01-31 14:27 UTC (permalink / raw)
To: gcc-bugs
------- Comment #58 from bonzini at gnu dot org 2009-01-31 14:26 -------
> Btw, on x86_64 leslie3d performance is now above that from before r126326.
Changing to 4.3 only. A separate bug (likely not a regression) should be
opened for the testcase of comment #47.
Might even be WONTFIX for 4.3.
--
bonzini at gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |bonzini at gnu dot org
Known to work| |4.4.0
Summary|[4.3/4.4 Regression] |[4.3 Regression] Revision
|Revision 126326 causes 12% |126326 causes 12% slowdown
|slowdown |
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (60 preceding siblings ...)
2009-01-31 14:27 ` [Bug tree-optimization/32921] [4.3 " bonzini at gnu dot org
@ 2009-01-31 14:36 ` rguenth at gcc dot gnu dot org
2009-02-02 19:17 ` pthaugen at gcc dot gnu dot org
62 siblings, 0 replies; 64+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2009-01-31 14:36 UTC (permalink / raw)
To: gcc-bugs
------- Comment #59 from rguenth at gcc dot gnu dot org 2009-01-31 14:35 -------
This is btw very likely fixed on the alias-improvements branch. SPEC testing
that for PPC would be much appreciated.
OTOH I agree with Paolo, so ... FIXED for 4.4, WONTFIX for 4.3.
--
rguenth at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|NEW |RESOLVED
Known to fail|4.3.0 |4.3.0 4.3.3
Resolution| |FIXED
Summary|[4.3 Regression] Revision |[4.3/4.4 Regression]
|126326 causes 12% slowdown |Revision 126326 causes 12%
| |slowdown
Target Milestone|4.3.4 |4.4.0
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
* [Bug tree-optimization/32921] [4.3/4.4 Regression] Revision 126326 causes 12% slowdown
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
` (61 preceding siblings ...)
2009-01-31 14:36 ` [Bug tree-optimization/32921] [4.3/4.4 " rguenth at gcc dot gnu dot org
@ 2009-02-02 19:17 ` pthaugen at gcc dot gnu dot org
62 siblings, 0 replies; 64+ messages in thread
From: pthaugen at gcc dot gnu dot org @ 2009-02-02 19:17 UTC (permalink / raw)
To: gcc-bugs
------- Comment #60 from pthaugen at gcc dot gnu dot org 2009-02-02 19:16 -------
I tried leslie3d on PPC. The alias-improvements branch does indeed seem to fix
the issue. The version of leslie3d built with the alias-improvements branch is
about 10% faster than a version built with trunk, and is equivalent to a trunk
version built with --param max-aliased-vops=10000.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921
^ permalink raw reply [flat|nested] 64+ messages in thread
end of thread, other threads:[~2009-02-02 19:17 UTC | newest]
Thread overview: 64+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2007-07-28 0:23 [Bug tree-optimization/32921] New: [4.3 Regression] Revision 126326 causes 12% slowdown hjl at lucon dot org
2007-07-28 0:32 ` [Bug tree-optimization/32921] " pinskia at gcc dot gnu dot org
2007-07-28 0:32 ` pinskia at gcc dot gnu dot org
2007-07-28 0:51 ` hjl at lucon dot org
2007-07-28 1:02 ` pinskia at gcc dot gnu dot org
2007-07-28 10:02 ` rguenth at gcc dot gnu dot org
2007-07-28 20:51 ` hjl at lucon dot org
2007-07-28 20:59 ` rguenther at suse dot de
2007-07-28 23:25 ` hjl at lucon dot org
2007-09-11 11:48 ` jsm28 at gcc dot gnu dot org
2007-10-07 15:17 ` hjl at lucon dot org
2007-10-07 15:17 ` hjl at lucon dot org
2007-10-07 15:19 ` hjl at lucon dot org
2007-10-10 17:48 ` mmitchel at gcc dot gnu dot org
2007-10-17 16:14 ` pthaugen at gcc dot gnu dot org
2007-10-17 16:19 ` pthaugen at gcc dot gnu dot org
2007-10-17 16:59 ` pinskia at gcc dot gnu dot org
2007-10-17 17:41 ` dberlin at dberlin dot org
2007-10-17 21:22 ` rguenth at gcc dot gnu dot org
2007-10-18 11:45 ` rguenth at gcc dot gnu dot org
2007-10-18 20:55 ` hjl at lucon dot org
2007-10-19 11:26 ` rguenth at gcc dot gnu dot org
2007-10-19 12:28 ` rguenth at gcc dot gnu dot org
2007-10-19 12:40 ` rguenth at gcc dot gnu dot org
2007-10-19 15:36 ` rguenth at gcc dot gnu dot org
2007-10-19 16:28 ` rguenth at gcc dot gnu dot org
2007-10-19 22:13 ` hjl at lucon dot org
2007-10-20 4:21 ` patchapp at dberlin dot org
2007-10-20 9:39 ` rguenth at gcc dot gnu dot org
2007-10-20 16:05 ` hjl at lucon dot org
2007-10-22 21:11 ` pthaugen at gcc dot gnu dot org
2007-10-23 2:23 ` Joey dot ye at intel dot com
2007-10-23 20:29 ` pthaugen at gcc dot gnu dot org
2007-10-23 20:31 ` pthaugen at gcc dot gnu dot org
2007-10-24 12:23 ` rguenth at gcc dot gnu dot org
2007-10-24 15:02 ` rguenth at gcc dot gnu dot org
2007-10-24 15:18 ` rguenth at gcc dot gnu dot org
2007-10-24 15:19 ` rguenth at gcc dot gnu dot org
2007-10-24 23:15 ` pthaugen at gcc dot gnu dot org
2007-10-25 8:55 ` rguenth at gcc dot gnu dot org
2007-10-25 10:06 ` pinskia at gcc dot gnu dot org
2007-10-25 11:02 ` rguenth at gcc dot gnu dot org
2007-10-25 11:07 ` rguenth at gcc dot gnu dot org
2008-03-14 16:53 ` [Bug tree-optimization/32921] [4.3/4.4 " rguenth at gcc dot gnu dot org
2008-04-23 14:10 ` rguenth at gcc dot gnu dot org
2008-04-23 14:10 ` rguenth at gcc dot gnu dot org
2008-04-30 18:50 ` pthaugen at gcc dot gnu dot org
2008-04-30 20:30 ` rguenth at gcc dot gnu dot org
2008-04-30 21:43 ` rguenth at gcc dot gnu dot org
2008-04-30 22:36 ` pthaugen at gcc dot gnu dot org
2008-05-01 19:27 ` pthaugen at gcc dot gnu dot org
2008-05-02 10:29 ` rguenth at gcc dot gnu dot org
2008-05-02 12:17 ` rguenth at gcc dot gnu dot org
2008-05-02 12:33 ` dnovillo at google dot com
2008-05-02 12:56 ` rguenther at suse dot de
2008-05-02 14:16 ` mmitchel at gcc dot gnu dot org
2008-05-02 15:06 ` rguenther at suse dot de
2008-06-06 14:59 ` rguenth at gcc dot gnu dot org
2008-08-27 22:04 ` jsm28 at gcc dot gnu dot org
2008-12-08 15:41 ` rguenth at gcc dot gnu dot org
2009-01-24 10:22 ` rguenth at gcc dot gnu dot org
2009-01-31 14:27 ` [Bug tree-optimization/32921] [4.3 " bonzini at gnu dot org
2009-01-31 14:36 ` [Bug tree-optimization/32921] [4.3/4.4 " rguenth at gcc dot gnu dot org
2009-02-02 19:17 ` pthaugen at gcc dot gnu dot org
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).