public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug c++/35585]  New: Miscompiled inline assembly
@ 2008-03-14 16:25 nicos at maunakeatech dot com
  2008-03-14 16:27 ` [Bug c++/35585] " nicos at maunakeatech dot com
                   ` (8 more replies)
  0 siblings, 9 replies; 10+ messages in thread
From: nicos at maunakeatech dot com @ 2008-03-14 16:25 UTC (permalink / raw)
  To: gcc-bugs

The project I work on uses an inline assembly part to compute floor and ceil
functions of floating point numbers and its seems that in some cases, with
gcc-4.2 and optimizations turned on, the computed values are not correct. I
attached the smallest testcase that I could come up with to reproduce the error
and the preprocessed input. Changing the testcase a little suffices to make the
miscompilation disappear.
Since I am not an expert on inline assembly in gcc, I am not sure that the
iCeil/iFloor functions are completely correct and perhaps this is not a bug...

I was unable to reproduce the error with gcc-4.1 and gcc-4.3. The error is
still present with gcc-4.2.3. The error only appears with -O2/-O3.

Compilation command:
g++-4.2 -v -save-temps testcase.cpp -o testcase -O3
Using built-in specs.
Target: i486-linux-gnu
Configured with: ../src/configure -v
--enable-languages=c,c++,fortran,objc,obj-c++,treelang --prefix=/usr
--enable-shared --with-system-zlib --libexecdir=/usr/lib
--without-included-gettext --enable-threads=posix --enable-nls
--with-gxx-include-dir=/usr/include/c++/4.2 --program-suffix=-4.2
--enable-clocale=gnu --enable-libstdcxx-debug --enable-mpfr
--enable-targets=all --enable-checking=release --build=i486-linux-gnu
--host=i486-linux-gnu --target=i486-linux-gnu
Thread model: posix
gcc version 4.2.1 (Ubuntu 4.2.1-5ubuntu4)
 /usr/lib/gcc/i486-linux-gnu/4.2.1/cc1plus -E -quiet -v -D_GNU_SOURCE
testcase.cpp -mtune=generic -O3 -fpch-preprocess -o testcase.ii
ignoring nonexistent directory "/usr/local/include/i486-linux-gnu"
ignoring nonexistent directory
"/usr/lib/gcc/i486-linux-gnu/4.2.1/../../../../i486-linux-gnu/include"
ignoring nonexistent directory "/usr/include/i486-linux-gnu"
#include "..." search starts here:
#include <...> search starts here:
 /usr/include/c++/4.2
 /usr/include/c++/4.2/i486-linux-gnu
 /usr/include/c++/4.2/backward
 /usr/local/include
 /usr/lib/gcc/i486-linux-gnu/4.2.1/include
 /usr/include
End of search list.
 /usr/lib/gcc/i486-linux-gnu/4.2.1/cc1plus -fpreprocessed testcase.ii -quiet
-dumpbase testcase.cpp -mtune=generic -auxbase testcase -O3 -version
-fstack-protector -fstack-protector -o testcase.s
GNU C++ version 4.2.1 (Ubuntu 4.2.1-5ubuntu4) (i486-linux-gnu)
        compiled by GNU C version 4.2.1 (Ubuntu 4.2.1-5ubuntu4).
GGC heuristics: --param ggc-min-expand=100 --param ggc-min-heapsize=131072
Compiler executable checksum: 44e55ae5d2724830dee11801424b84d8
 as --traditional-format -V -Qy -o testcase.o testcase.s
GNU assembler version 2.18 (i486-linux-gnu) using BFD version (GNU Binutils for
Ubuntu) 2.18
 /usr/lib/gcc/i486-linux-gnu/4.2.1/collect2 --eh-frame-hdr -m elf_i386
--hash-style=both -dynamic-linker /lib/ld-linux.so.2 -o testcase
/usr/lib/gcc/i486-linux-gnu/4.2.1/../../../../lib/crt1.o
/usr/lib/gcc/i486-linux-gnu/4.2.1/../../../../lib/crti.o
/usr/lib/gcc/i486-linux-gnu/4.2.1/crtbegin.o
-L/usr/lib/gcc/i486-linux-gnu/4.2.1 -L/usr/lib/gcc/i486-linux-gnu/4.2.1
-L/usr/lib/gcc/i486-linux-gnu/4.2.1/../../../../lib -L/lib/../lib
-L/usr/lib/../lib -L/usr/lib/gcc/i486-linux-gnu/4.2.1/../../.. testcase.o
-lstdc++ -lm -lgcc_s -lgcc -lc -lgcc_s -lgcc
/usr/lib/gcc/i486-linux-gnu/4.2.1/crtend.o
/usr/lib/gcc/i486-linux-gnu/4.2.1/../../../../lib/crtn.o


-- 
           Summary: Miscompiled inline assembly
           Product: gcc
           Version: 4.2.1
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: c++
        AssignedTo: unassigned at gcc dot gnu dot org
        ReportedBy: nicos at maunakeatech dot com
GCC target triplet: i486-linux-gnu


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=35585


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug c++/35585] Miscompiled inline assembly
  2008-03-14 16:25 [Bug c++/35585] New: Miscompiled inline assembly nicos at maunakeatech dot com
@ 2008-03-14 16:27 ` nicos at maunakeatech dot com
  2008-03-14 16:29 ` nicos at maunakeatech dot com
                   ` (7 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: nicos at maunakeatech dot com @ 2008-03-14 16:27 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #1 from nicos at maunakeatech dot com  2008-03-14 16:26 -------
Created an attachment (id=15317)
 --> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=15317&action=view)
Testcase to reproduce the bug


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=35585


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug c++/35585] Miscompiled inline assembly
  2008-03-14 16:25 [Bug c++/35585] New: Miscompiled inline assembly nicos at maunakeatech dot com
  2008-03-14 16:27 ` [Bug c++/35585] " nicos at maunakeatech dot com
@ 2008-03-14 16:29 ` nicos at maunakeatech dot com
  2008-03-14 17:31 ` rguenth at gcc dot gnu dot org
                   ` (6 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: nicos at maunakeatech dot com @ 2008-03-14 16:29 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #2 from nicos at maunakeatech dot com  2008-03-14 16:29 -------
Created an attachment (id=15318)
 --> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=15318&action=view)
Preprocessed testcase


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=35585


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug c++/35585] Miscompiled inline assembly
  2008-03-14 16:25 [Bug c++/35585] New: Miscompiled inline assembly nicos at maunakeatech dot com
  2008-03-14 16:27 ` [Bug c++/35585] " nicos at maunakeatech dot com
  2008-03-14 16:29 ` nicos at maunakeatech dot com
@ 2008-03-14 17:31 ` rguenth at gcc dot gnu dot org
  2008-03-15  1:08 ` nicos at maunakeatech dot com
                   ` (5 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2008-03-14 17:31 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #3 from rguenth at gcc dot gnu dot org  2008-03-14 17:30 -------
This is caused by extra precision on the 387 FPU.

*** This bug has been marked as a duplicate of 323 ***


-- 

rguenth at gcc dot gnu dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |RESOLVED
         Resolution|                            |DUPLICATE


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=35585


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug c++/35585] Miscompiled inline assembly
  2008-03-14 16:25 [Bug c++/35585] New: Miscompiled inline assembly nicos at maunakeatech dot com
                   ` (2 preceding siblings ...)
  2008-03-14 17:31 ` rguenth at gcc dot gnu dot org
@ 2008-03-15  1:08 ` nicos at maunakeatech dot com
  2008-03-15 11:29 ` [Bug tree-optimization/35585] [4.2 Regression] " rguenth at gcc dot gnu dot org
                   ` (4 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: nicos at maunakeatech dot com @ 2008-03-15  1:08 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #4 from nicos at maunakeatech dot com  2008-03-15 01:07 -------
I think I need some help here, I looked to bug 323 and I can't see how it is
related to this issue.
The assertion at the end of the test case compares integers, and the iFloor
function is only applied to 0 in the test. My problem is that the result in
sp.bbox[0] seems total garbage.
I looked at the assembly code generated by gcc (-S flag), the portion
corresponding to:

sp.bbox[0] = std::min(sp.bbox[0], iFloor(txcum) );

is:

.L201:
        fld     %st(1)
        fadd    %st(2), %st
        fsubs   .LC4
#APP
        fistpl -44(%ebp)
#NO_APP
        movl    -44(%ebp), %edi
        movl    -88(%ebp), %eax
        sarl    %edi
        cmpl    16(%ebx), %edi
        jge     .L204
        leal    -28(%ebp), %eax
.L204:
        fld     %st(0)
        movl    (%eax), %eax
        fadd    %st(1), %st
        fsubs   .LC4
        movl    %eax, 16(%ebx)

The result of iFloor is stored in %edi but whatever the result of cmpl, the
value in %edi isn't used (instead the value at -28(%ebp), which appears to be
uninitialized, is stored into sp.bbox[0]).


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=35585


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug tree-optimization/35585] [4.2 Regression] Miscompiled inline assembly
  2008-03-14 16:25 [Bug c++/35585] New: Miscompiled inline assembly nicos at maunakeatech dot com
                   ` (3 preceding siblings ...)
  2008-03-15  1:08 ` nicos at maunakeatech dot com
@ 2008-03-15 11:29 ` rguenth at gcc dot gnu dot org
  2008-03-15 11:34 ` rguenth at gcc dot gnu dot org
                   ` (3 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2008-03-15 11:29 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #5 from rguenth at gcc dot gnu dot org  2008-03-15 11:28 -------
Sorry.  The tree optimizers produce

<L62>:;
  __asm__ __volatile__("fistl %0":"=m" r:"t" txcum * 2.0e+0 - 5.0e-1);
  D.16879 = r >> 1;
  if (D.16879 < sp->bbox[0]) goto <L105>; else goto <L17>;

<L105>:;
  __b = &D.12083;
  goto <bb 20> (<L18>);

<L17>:;
  __b = &sp->bbox[0];

<L18>:;
  sp->bbox[0] = *__b;

which looks suspicious as well, as D.12083 is not the correct result here
(but in fact is uninitialized).  And this is store-sinking which makes
a mess of it:

Sinking #   D.12083_947 = V_MUST_DEF <D.12083_380>;
D.12083 = D.16879_333 from bb 21 to bb 52

because of wrong alias information computed right before this pass:

<L62>:;
  D.16874_329 = txcum_285 * 2.0e+0;
  x_330 = D.16874_329 - 5.0e-1;
  #   r_946 = V_MAY_DEF <r_288>;
  __asm__ __volatile__("fistl %0":"=m" r:"t" x_330);
  #   VUSE <r_946>;
  r.41_332 = r;
  D.16879_333 = r.41_332 >> 1;
  #   D.12083_947 = V_MUST_DEF <D.12083_380>;
  D.12083 = D.16879_333;
  #   VUSE <SFT.738_136>;
  #   VUSE <SFT.739_451>;
  #   VUSE <SFT.740_367>;
  D.16880_340 = sp_119->bbox[0];
  if (D.16879_333 < D.16880_340) goto <L91>; else goto <L17>;

<L91>:;
  goto <bb 23> (<L18>);

<L17>:;
  __b_342 = &sp_119->bbox[0];

  # __b_5 = PHI <&D.12083(48), __b_342(22)>;
<L18>:;
  #   VUSE <r_946>;
  #   VUSE <r_284>;
  #   VUSE <r_6>;
  #   VUSE <r_43>;
  D.12120_344 = *__b_5;


Oh well, it's not that 4.2 does not have known aliasing related problems.


-- 

rguenth at gcc dot gnu dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|RESOLVED                    |UNCONFIRMED
          Component|c++                         |tree-optimization
           Keywords|                            |alias, wrong-code
           Priority|P3                          |P2
         Resolution|DUPLICATE                   |
            Summary|Miscompiled inline assembly |[4.2 Regression] Miscompiled
                   |                            |inline assembly
   Target Milestone|---                         |4.2.4


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=35585


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug tree-optimization/35585] [4.2 Regression] Miscompiled inline assembly
  2008-03-14 16:25 [Bug c++/35585] New: Miscompiled inline assembly nicos at maunakeatech dot com
                   ` (4 preceding siblings ...)
  2008-03-15 11:29 ` [Bug tree-optimization/35585] [4.2 Regression] " rguenth at gcc dot gnu dot org
@ 2008-03-15 11:34 ` rguenth at gcc dot gnu dot org
  2008-03-15 11:52 ` rguenth at gcc dot gnu dot org
                   ` (2 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2008-03-15 11:34 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #6 from rguenth at gcc dot gnu dot org  2008-03-15 11:33 -------
In fact it's completely wrong.

  # __b_5 = PHI <&D.12083(48), __b_342(22)>;
<L18>:;
  #   VUSE <r_946>;
  #   VUSE <r_284>;
  #   VUSE <r_6>;
  #   VUSE <r_43>;
  D.12120_344 = *__b_5;

should be

  # __b_5 = PHI <&D.12083(48), __b_342(22)>;
<L18>:;
  #   VUSE <SFT.738_136>;
  #   VUSE <SFT.739_451>;
  #   VUSE <SFT.740_367>;
  #   VUSE <D.12083_947>;
  D.12120_344 = *__b_5;

This looks like a const vs. non-const issue (which I vaguely remember).


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=35585


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug tree-optimization/35585] [4.2 Regression] Miscompiled inline assembly
  2008-03-14 16:25 [Bug c++/35585] New: Miscompiled inline assembly nicos at maunakeatech dot com
                   ` (5 preceding siblings ...)
  2008-03-15 11:34 ` rguenth at gcc dot gnu dot org
@ 2008-03-15 11:52 ` rguenth at gcc dot gnu dot org
  2008-05-19 20:34 ` jsm28 at gcc dot gnu dot org
  2009-03-31 15:26 ` jsm28 at gcc dot gnu dot org
  8 siblings, 0 replies; 10+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2008-03-15 11:52 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #7 from rguenth at gcc dot gnu dot org  2008-03-15 11:51 -------
points-to works well and ends up with __b_5 pointing to anything, so we fall
back to use SMTs which in this case is (for const int& __b):

SMT.761, UID 18425, const int, is addressable, is global, call clobbered, may
aliases: { r r r r }

there you go.  It should also (at least) alias

D.12083, UID 12083, int, is aliased, is addressable, call clobbered, default
def: D.12083_242

SMT.763, UID 18427, struct SceneProps, is addressable, is global, call
clobbered, default def: SMT.763_469, may aliases: { SFT.738 SFT.739 SFT.740 r r
r r D.12089 D.12083 D.12084 D.12088 }

so it is flow-insensitive alias analysis that gets it wrong.


-- 

rguenth at gcc dot gnu dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |NEW
     Ever Confirmed|0                           |1
   Last reconfirmed|0000-00-00 00:00:00         |2008-03-15 11:51:54
               date|                            |


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=35585


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug tree-optimization/35585] [4.2 Regression] Miscompiled inline assembly
  2008-03-14 16:25 [Bug c++/35585] New: Miscompiled inline assembly nicos at maunakeatech dot com
                   ` (6 preceding siblings ...)
  2008-03-15 11:52 ` rguenth at gcc dot gnu dot org
@ 2008-05-19 20:34 ` jsm28 at gcc dot gnu dot org
  2009-03-31 15:26 ` jsm28 at gcc dot gnu dot org
  8 siblings, 0 replies; 10+ messages in thread
From: jsm28 at gcc dot gnu dot org @ 2008-05-19 20:34 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #8 from jsm28 at gcc dot gnu dot org  2008-05-19 20:25 -------
4.2.4 is being released, changing milestones to 4.2.5.


-- 

jsm28 at gcc dot gnu dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|4.2.4                       |4.2.5


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=35585


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug tree-optimization/35585] [4.2 Regression] Miscompiled inline assembly
  2008-03-14 16:25 [Bug c++/35585] New: Miscompiled inline assembly nicos at maunakeatech dot com
                   ` (7 preceding siblings ...)
  2008-05-19 20:34 ` jsm28 at gcc dot gnu dot org
@ 2009-03-31 15:26 ` jsm28 at gcc dot gnu dot org
  8 siblings, 0 replies; 10+ messages in thread
From: jsm28 at gcc dot gnu dot org @ 2009-03-31 15:26 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #9 from jsm28 at gcc dot gnu dot org  2009-03-31 15:25 -------
Closing 4.2 branch, fixed for 4.3.


-- 

jsm28 at gcc dot gnu dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
      Known to fail|4.2.1 4.2.3                 |4.2.1 4.2.3 4.2.5
         Resolution|                            |FIXED
   Target Milestone|4.2.5                       |4.3.0


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=35585


^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2009-03-31 15:26 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2008-03-14 16:25 [Bug c++/35585] New: Miscompiled inline assembly nicos at maunakeatech dot com
2008-03-14 16:27 ` [Bug c++/35585] " nicos at maunakeatech dot com
2008-03-14 16:29 ` nicos at maunakeatech dot com
2008-03-14 17:31 ` rguenth at gcc dot gnu dot org
2008-03-15  1:08 ` nicos at maunakeatech dot com
2008-03-15 11:29 ` [Bug tree-optimization/35585] [4.2 Regression] " rguenth at gcc dot gnu dot org
2008-03-15 11:34 ` rguenth at gcc dot gnu dot org
2008-03-15 11:52 ` rguenth at gcc dot gnu dot org
2008-05-19 20:34 ` jsm28 at gcc dot gnu dot org
2009-03-31 15:26 ` jsm28 at gcc dot gnu dot org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).