[Bug target/43703] New: Unexpected floating point precision loss due to ARM NEON autovectorization

public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed

* [Bug target/43703]  New: Unexpected floating point precision loss due to ARM NEON autovectorization
@ 2010-04-09 14:34 siarhei dot siamashka at gmail dot com
  2010-04-09 19:55 ` [Bug target/43703] " pinskia at gcc dot gnu dot org
                   ` (7 more replies)
  0 siblings, 8 replies; 10+ messages in thread
From: siarhei dot siamashka at gmail dot com @ 2010-04-09 14:34 UTC (permalink / raw)
  To: gcc-bugs

Using gcc-4.5.0-RC-20100406.tar.bz2

/************************************************************/
#include <stdio.h>

void __attribute__((noinline)) f(float * __restrict c,
                                 float * __restrict a,
                                 float * __restrict b)
{
    int i;
    for (i = 0; i < 4; i++) {
        c[i] = a[i] * b[i];
    }
}

int main()
{
    float a[4], b[4], c[4];

    a[0] = 1e-40;
    b[0] = 1e+38;

    f(c, a, b);

    printf("c[0]=%f\n", (double)c[0]);
    if (c[0] < 0.001)
        printf("precision problem: c[0] was flushed to zero\n");

    return 0;
}
/************************************************************/

# gcc -mcpu=cortex-a8 -mfloat-abi=softfp -mfpu=neon -O2 -fno-fast-math test.c
# ./a.out
c[0]=0.010000

# gcc -mcpu=cortex-a8 -mfloat-abi=softfp -mfpu=neon -O3 -fno-fast-math test.c
# ./a.out
c[0]=0.000000
precision problem: c[0] was flushed to zero


Using -O3 option turns on autovectorization, and the results of operations
involving denormals get flushed to zero. This happens even if no "-ffast-math"
or any other precision sacrificing options are enabled.


-- 
           Summary: Unexpected floating point precision loss due to ARM NEON
                    autovectorization
           Product: gcc
           Version: 4.5.0
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: target
        AssignedTo: unassigned at gcc dot gnu dot org
        ReportedBy: siarhei dot siamashka at gmail dot com
 GCC build triplet: armv7l-unknown-linux-gnueabi
  GCC host triplet: armv7l-unknown-linux-gnueabi
GCC target triplet: armv7l-unknown-linux-gnueabi


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43703


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/43703] Unexpected floating point precision loss due to ARM NEON autovectorization
  2010-04-09 14:34 [Bug target/43703] New: Unexpected floating point precision loss due to ARM NEON autovectorization siarhei dot siamashka at gmail dot com
@ 2010-04-09 19:55 ` pinskia at gcc dot gnu dot org
  2010-04-09 20:34 ` siarhei dot siamashka at gmail dot com
                   ` (6 subsequent siblings)
  7 siblings, 0 replies; 10+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2010-04-09 19:55 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #1 from pinskia at gcc dot gnu dot org  2010-04-09 19:55 -------
This is exacted really.  Denormals are a weird case in general.  Plus your
testcase depends on uninitialized values.


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43703


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/43703] Unexpected floating point precision loss due to ARM NEON autovectorization
  2010-04-09 14:34 [Bug target/43703] New: Unexpected floating point precision loss due to ARM NEON autovectorization siarhei dot siamashka at gmail dot com
  2010-04-09 19:55 ` [Bug target/43703] " pinskia at gcc dot gnu dot org
@ 2010-04-09 20:34 ` siarhei dot siamashka at gmail dot com
  2010-04-12  9:17 ` ramana at gcc dot gnu dot org
                   ` (5 subsequent siblings)
  7 siblings, 0 replies; 10+ messages in thread
From: siarhei dot siamashka at gmail dot com @ 2010-04-09 20:34 UTC (permalink / raw)
  To: gcc-bugs

------- Comment #2 from siarhei dot siamashka at gmail dot com  2010-04-09 20:34 -------
(In reply to comment #1)
> This is exacted really.  Denormals are a weird case in general.

Well, denormals may be weird. But what about nan's, inf's and the other IEEE
stuff, which is not supported by NEON unit? The compiler here takes the liberty
of using NEON whenever it likes, and NEON does not fully support IEEE for sure.
After reading man gcc, I had an impression that this should have been
controlled by -ffast-math and the related options.

Floating point performance of VFP Lite unit is a disaster, and using NEON where
appropriate is definitely needed. But IMHO this should be controlled somehow.
For example by selectively using pragma optimize to set -ffast-math option in
the critical parts of code.

Also I don't know how fantastic it is, but having a special data type,
something like 'fast_float' with the relaxed precision requirements and
suitable for use with NEON would be really nice.

> Plus your testcase depends on uninitialized values.

Yes, the testcase is not quite clean, but is easily fixable. Though this should
not cause any problems unless floating point exceptions are enabled, those
extra values are just irrelevant. Should I post an updated testcase?

-- 

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43703

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/43703] Unexpected floating point precision loss due to ARM NEON autovectorization
  2010-04-09 14:34 [Bug target/43703] New: Unexpected floating point precision loss due to ARM NEON autovectorization siarhei dot siamashka at gmail dot com
  2010-04-09 19:55 ` [Bug target/43703] " pinskia at gcc dot gnu dot org
  2010-04-09 20:34 ` siarhei dot siamashka at gmail dot com
@ 2010-04-12  9:17 ` ramana at gcc dot gnu dot org
  2010-06-15 10:35 ` siarhei dot siamashka at gmail dot com
                   ` (4 subsequent siblings)
  7 siblings, 0 replies; 10+ messages in thread
From: ramana at gcc dot gnu dot org @ 2010-04-12  9:17 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #3 from ramana at gcc dot gnu dot org  2010-04-12 09:17 -------
Could you post a cleaned-up testcase ? I tried a cleaned up testcase with the
values appropriately zero-initialized and gcc ends up generating the vectorized
value in this case. 


-- 

ramana at gcc dot gnu dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |NEW
     Ever Confirmed|0                           |1
           Keywords|                            |wrong-code
      Known to fail|                            |4.4.3 4.5.0
   Last reconfirmed|0000-00-00 00:00:00         |2010-04-12 09:17:11
               date|                            |


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43703


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/43703] Unexpected floating point precision loss due to ARM NEON autovectorization
  2010-04-09 14:34 [Bug target/43703] New: Unexpected floating point precision loss due to ARM NEON autovectorization siarhei dot siamashka at gmail dot com
                   ` (2 preceding siblings ...)
  2010-04-12  9:17 ` ramana at gcc dot gnu dot org
@ 2010-06-15 10:35 ` siarhei dot siamashka at gmail dot com
  2010-06-16 11:42 ` jules at gcc dot gnu dot org
                   ` (3 subsequent siblings)
  7 siblings, 0 replies; 10+ messages in thread
From: siarhei dot siamashka at gmail dot com @ 2010-06-15 10:35 UTC (permalink / raw)
  To: gcc-bugs

------- Comment #4 from siarhei dot siamashka at gmail dot com  2010-06-15 10:34 -------
Created an attachment (id=20913)
 --> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=20913&action=view)
a fixed testcase

A fixed testcase attached.

The main problem here is that denormals are not handled in a 'civilized' way by
gcc at the moment. They are just silently and unconditionally treated in a
relaxed way, and that might be neither wanted nor expected by the user. And
'readelf -A' shows the following EABI tags for the generated object file, even
not marking it in a special way with the regards to denormals handling:
  Tag_ABI_FP_denormal: Needed
  Tag_ABI_FP_exceptions: Needed
  Tag_ABI_FP_number_model: IEEE 754

-- 

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43703

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/43703] Unexpected floating point precision loss due to ARM NEON autovectorization
  2010-04-09 14:34 [Bug target/43703] New: Unexpected floating point precision loss due to ARM NEON autovectorization siarhei dot siamashka at gmail dot com
                   ` (3 preceding siblings ...)
  2010-06-15 10:35 ` siarhei dot siamashka at gmail dot com
@ 2010-06-16 11:42 ` jules at gcc dot gnu dot org
  2010-06-22  1:55 ` sandra at codesourcery dot com
                   ` (2 subsequent siblings)
  7 siblings, 0 replies; 10+ messages in thread
From: jules at gcc dot gnu dot org @ 2010-06-16 11:42 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #5 from jules at gcc dot gnu dot org  2010-06-16 11:41 -------
I am working on this.


-- 

jules at gcc dot gnu dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
         AssignedTo|unassigned at gcc dot gnu   |jules at gcc dot gnu dot org
                   |dot org                     |
             Status|NEW                         |ASSIGNED
   Last reconfirmed|2010-04-12 09:17:11         |2010-06-16 11:41:32
               date|                            |


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43703


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/43703] Unexpected floating point precision loss due to ARM NEON autovectorization
  2010-04-09 14:34 [Bug target/43703] New: Unexpected floating point precision loss due to ARM NEON autovectorization siarhei dot siamashka at gmail dot com
                   ` (4 preceding siblings ...)
  2010-06-16 11:42 ` jules at gcc dot gnu dot org
@ 2010-06-22  1:55 ` sandra at codesourcery dot com
  2010-07-03  0:47 ` sandra at gcc dot gnu dot org
  2010-07-05 12:22 ` ramana at gcc dot gnu dot org
  7 siblings, 0 replies; 10+ messages in thread
From: sandra at codesourcery dot com @ 2010-06-22  1:55 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #6 from sandra at codesourcery dot com  2010-06-22 01:55 -------
Julian's patch overlapped some other NEON changes I was already preparing for
submission, so I did some refactoring before posting it for review.  Here's the
main part of the fix:

http://gcc.gnu.org/ml/gcc-patches/2010-06/msg02102.html


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43703


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/43703] Unexpected floating point precision loss due to ARM NEON autovectorization
  2010-04-09 14:34 [Bug target/43703] New: Unexpected floating point precision loss due to ARM NEON autovectorization siarhei dot siamashka at gmail dot com
                   ` (5 preceding siblings ...)
  2010-06-22  1:55 ` sandra at codesourcery dot com
@ 2010-07-03  0:47 ` sandra at gcc dot gnu dot org
  2010-07-05 12:22 ` ramana at gcc dot gnu dot org
  7 siblings, 0 replies; 10+ messages in thread
From: sandra at gcc dot gnu dot org @ 2010-07-03  0:47 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #7 from sandra at gcc dot gnu dot org  2010-07-03 00:47 -------
Subject: Bug 43703

Author: sandra
Date: Sat Jul  3 00:46:51 2010
New Revision: 161763

URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=161763
Log:
2010-07-02  Julian Brown  <julian@codesourcery.com>
            Sandra Loosemore <sandra@codesourcery.com>

        PR target/43703

        gcc/
        * config/arm/vec-common.md (add<mode>3, sub<mode>3, smin<mode>3)
        (smax<mode>3): Disable for NEON float modes when
        flag_unsafe_math_optimizations is false.
        * config/arm/neon.md (*add<mode>3_neon, *sub<mode>3_neon)
        (*mul<mode>3_neon)
        (mul<mode>3add<mode>_neon, mul<mode>3neg<mode>add<mode>_neon)
        (reduc_splus_<mode>, reduc_smin_<mode>, reduc_smax_<mode>): Disable
        for NEON float modes when flag_unsafe_math_optimizations is false.
        (quad_halves_<code>v4sf): Only enable if flag_unsafe_math_optimizations
        is true.
        * doc/invoke.texi (ARM Options): Add note about floating point
        vectorization requiring -funsafe-math-optimizations.

        gcc/testsuite/
        * gcc.dg/vect/vect.exp: Add -ffast-math for NEON.
        * gcc.dg/vect/vect-reduc-6.c: Add XFAIL for NEON.



Modified:
    trunk/gcc/ChangeLog
    trunk/gcc/config/arm/neon.md
    trunk/gcc/config/arm/vec-common.md
    trunk/gcc/doc/invoke.texi
    trunk/gcc/testsuite/ChangeLog
    trunk/gcc/testsuite/gcc.dg/vect/vect-reduc-6.c
    trunk/gcc/testsuite/gcc.dg/vect/vect.exp


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43703


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/43703] Unexpected floating point precision loss due to ARM NEON autovectorization
  2010-04-09 14:34 [Bug target/43703] New: Unexpected floating point precision loss due to ARM NEON autovectorization siarhei dot siamashka at gmail dot com
                   ` (6 preceding siblings ...)
  2010-07-03  0:47 ` sandra at gcc dot gnu dot org
@ 2010-07-05 12:22 ` ramana at gcc dot gnu dot org
  7 siblings, 0 replies; 10+ messages in thread
From: ramana at gcc dot gnu dot org @ 2010-07-05 12:22 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #8 from ramana at gcc dot gnu dot org  2010-07-05 12:22 -------
Are you planning to backport this to all release branches since this affects
all of them ?

cheers
Ramana


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43703


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/43703] Unexpected floating point precision loss due to ARM NEON autovectorization
       [not found] <bug-43703-4@http.gcc.gnu.org/bugzilla/>
@ 2012-07-10 13:23 ` rearnsha at gcc dot gnu.org
  0 siblings, 0 replies; 10+ messages in thread
From: rearnsha at gcc dot gnu.org @ 2012-07-10 13:23 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43703

Richard Earnshaw <rearnsha at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|ASSIGNED                    |RESOLVED
         Resolution|                            |FIXED
   Target Milestone|---                         |4.6.0
      Known to fail|                            |

--- Comment #9 from Richard Earnshaw <rearnsha at gcc dot gnu.org> 2012-07-10 13:22:33 UTC ---
Fixed in 4.6.0.  4.5 is no-longer being maintained.


^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2012-07-10 13:23 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2010-04-09 14:34 [Bug target/43703] New: Unexpected floating point precision loss due to ARM NEON autovectorization siarhei dot siamashka at gmail dot com
2010-04-09 19:55 ` [Bug target/43703] " pinskia at gcc dot gnu dot org
2010-04-09 20:34 ` siarhei dot siamashka at gmail dot com
2010-04-12  9:17 ` ramana at gcc dot gnu dot org
2010-06-15 10:35 ` siarhei dot siamashka at gmail dot com
2010-06-16 11:42 ` jules at gcc dot gnu dot org
2010-06-22  1:55 ` sandra at codesourcery dot com
2010-07-03  0:47 ` sandra at gcc dot gnu dot org
2010-07-05 12:22 ` ramana at gcc dot gnu dot org
     [not found] <bug-43703-4@http.gcc.gnu.org/bugzilla/>
2012-07-10 13:23 ` rearnsha at gcc dot gnu.org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).