[Bug libgcc/113155] New: large overhead for cast float to uint64_t. Arm cortex-m4 (ARMv7E-M, fpv4-sp-d16, ieee 754). Compiler: arm-none-eabi-gcc

public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed

* [Bug libgcc/113155] New: large overhead for cast float to uint64_t. Arm cortex-m4 (ARMv7E-M, fpv4-sp-d16, ieee 754). Compiler: arm-none-eabi-gcc
@ 2023-12-27  7:03 kirdyankinsp at gmail dot com
  2023-12-27  7:17 ` [Bug target/113155] " pinskia at gcc dot gnu.org
                   ` (6 more replies)
  0 siblings, 7 replies; 8+ messages in thread
From: kirdyankinsp at gmail dot com @ 2023-12-27  7:03 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113155

            Bug ID: 113155
           Summary: large overhead for cast float to uint64_t. Arm
                    cortex-m4 (ARMv7E-M, fpv4-sp-d16, ieee 754). Compiler:
                    arm-none-eabi-gcc
           Product: gcc
           Version: unknown
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: libgcc
          Assignee: unassigned at gcc dot gnu.org
          Reporter: kirdyankinsp at gmail dot com
  Target Milestone: ---

To cast float to uint64 the initial value first converted to double. Cortex-M4
does not have hardware support for "double". Therefore, such an implementation
adds several kilobytes of code, which is unacceptable for embeded systems with
limited resources. In addition, increasing the code size is, of course, low
performance.
I think it is necessary to separate the softfloat functions for platforms that
have hardware "float" support and do not have a hardware "double". The current
version cannot be used for embedded systems. Additional functions need to be
written.
Below is the __aeabi_f2ulz code for fpv4-sp-d16 (ieee 754):

uint64_t __aeabi_f2ulz(float f)
{
  uint32_t result = *((uint32_t*) &f);
  uint32_t exp;
  result &= (~0x80000000);
  exp = result >> 23;
  result &= 0x7FFFFF;
  result |= 0x800000;
#if CHECK_UB
  if (exp == 0xFF) // if NaN or inf
  {
    return 0x8000000000000000; // ????????
  }
  if(exp > (0x96 + 40)) // if the variable value is too large
  {
    return 0x8000000000000000; // ????????
  }
#endif
  if (exp < 0x7F)
  {
    return 0;
  }
  if (exp <= 0x96U)
  {
    exp = 0x96 - exp;
    result >>= exp;
    return (uint64_t) result;
  }
  exp -= 0x96U;
  return (uint64_t) result << exp;
}

^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug target/113155] large overhead for cast float to uint64_t. Arm cortex-m4 (ARMv7E-M, fpv4-sp-d16, ieee 754). Compiler: arm-none-eabi-gcc
  2023-12-27  7:03 [Bug libgcc/113155] New: large overhead for cast float to uint64_t. Arm cortex-m4 (ARMv7E-M, fpv4-sp-d16, ieee 754). Compiler: arm-none-eabi-gcc kirdyankinsp at gmail dot com
@ 2023-12-27  7:17 ` pinskia at gcc dot gnu.org
  2023-12-27  7:23 ` pinskia at gcc dot gnu.org
                   ` (5 subsequent siblings)
  6 siblings, 0 replies; 8+ messages in thread
From: pinskia at gcc dot gnu.org @ 2023-12-27  7:17 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113155

Andrew Pinski <pinskia at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Severity|normal                      |enhancement
           Keywords|                            |missed-optimization
          Component|libgcc                      |target

^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug target/113155] large overhead for cast float to uint64_t. Arm cortex-m4 (ARMv7E-M, fpv4-sp-d16, ieee 754). Compiler: arm-none-eabi-gcc
  2023-12-27  7:03 [Bug libgcc/113155] New: large overhead for cast float to uint64_t. Arm cortex-m4 (ARMv7E-M, fpv4-sp-d16, ieee 754). Compiler: arm-none-eabi-gcc kirdyankinsp at gmail dot com
  2023-12-27  7:17 ` [Bug target/113155] " pinskia at gcc dot gnu.org
@ 2023-12-27  7:23 ` pinskia at gcc dot gnu.org
  2023-12-27  7:24 ` pinskia at gcc dot gnu.org
                   ` (4 subsequent siblings)
  6 siblings, 0 replies; 8+ messages in thread
From: pinskia at gcc dot gnu.org @ 2023-12-27  7:23 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113155

--- Comment #1 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
I looked into the sources of libgcc, __aeabi_f2ulz is defined as __fixunssfdi
which is defined in soft-fp/fixunssfdi.c and that does:
```
UDItype
__fixunssfdi (SFtype a)
{
  FP_DECL_EX;
  FP_DECL_S (A);
  UDItype r;

  FP_INIT_EXCEPTIONS;
  FP_UNPACK_RAW_S (A, a);
  FP_TO_INT_S (r, A, DI_BITS, 0);
  FP_HANDLE_EXCEPTIONS;

  return r;
}
```

FP_TO_INT_S is defined to _FP_TO_INT which basically does what you want except
it can handle FP exceptions.

^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug target/113155] large overhead for cast float to uint64_t. Arm cortex-m4 (ARMv7E-M, fpv4-sp-d16, ieee 754). Compiler: arm-none-eabi-gcc
  2023-12-27  7:03 [Bug libgcc/113155] New: large overhead for cast float to uint64_t. Arm cortex-m4 (ARMv7E-M, fpv4-sp-d16, ieee 754). Compiler: arm-none-eabi-gcc kirdyankinsp at gmail dot com
  2023-12-27  7:17 ` [Bug target/113155] " pinskia at gcc dot gnu.org
  2023-12-27  7:23 ` pinskia at gcc dot gnu.org
@ 2023-12-27  7:24 ` pinskia at gcc dot gnu.org
  2023-12-27  7:34 ` pinskia at gcc dot gnu.org
                   ` (3 subsequent siblings)
  6 siblings, 0 replies; 8+ messages in thread
From: pinskia at gcc dot gnu.org @ 2023-12-27  7:24 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113155

Andrew Pinski <pinskia at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |WAITING
     Ever confirmed|0                           |1
   Last reconfirmed|                            |2023-12-27

--- Comment #2 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
>The current version cannot be used for embedded systems.

I am thinking you don't have the multilibs set up correctly.

How did you configure GCC here?

^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug target/113155] large overhead for cast float to uint64_t. Arm cortex-m4 (ARMv7E-M, fpv4-sp-d16, ieee 754). Compiler: arm-none-eabi-gcc
  2023-12-27  7:03 [Bug libgcc/113155] New: large overhead for cast float to uint64_t. Arm cortex-m4 (ARMv7E-M, fpv4-sp-d16, ieee 754). Compiler: arm-none-eabi-gcc kirdyankinsp at gmail dot com
                   ` (2 preceding siblings ...)
  2023-12-27  7:24 ` pinskia at gcc dot gnu.org
@ 2023-12-27  7:34 ` pinskia at gcc dot gnu.org
  2023-12-28  5:07 ` kirdyankinsp at gmail dot com
                   ` (2 subsequent siblings)
  6 siblings, 0 replies; 8+ messages in thread
From: pinskia at gcc dot gnu.org @ 2023-12-27  7:34 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113155

--- Comment #3 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
Can you provide the following:
* How you configured/built GCC?
* What command line options that you pass to GCC for building your application?

I am suspecting you are not using the correct options to get the  __aeabi_f2ulz
that does not convert to double first.

^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug target/113155] large overhead for cast float to uint64_t. Arm cortex-m4 (ARMv7E-M, fpv4-sp-d16, ieee 754). Compiler: arm-none-eabi-gcc
  2023-12-27  7:03 [Bug libgcc/113155] New: large overhead for cast float to uint64_t. Arm cortex-m4 (ARMv7E-M, fpv4-sp-d16, ieee 754). Compiler: arm-none-eabi-gcc kirdyankinsp at gmail dot com
                   ` (3 preceding siblings ...)
  2023-12-27  7:34 ` pinskia at gcc dot gnu.org
@ 2023-12-28  5:07 ` kirdyankinsp at gmail dot com
  2023-12-28  6:22 ` pinskia at gcc dot gnu.org
  2023-12-28  6:48 ` kirdyankinsp at gmail dot com
  6 siblings, 0 replies; 8+ messages in thread
From: kirdyankinsp at gmail dot com @ 2023-12-28  5:07 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113155

--- Comment #4 from Sergey Kirdyankin <kirdyankinsp at gmail dot com> ---
Compiler command line (platform configure):
arm-none-eabi-gcc -march=armv7e-m -mcpu=cortex-m4 -mfloat-abi=hard
-mfpu=fpv4-sp-d16 -mthumb ...

Disassembler:
080016c0 <__aeabi_f2ulz>:
 80016c0:       b5d0            push    {r4, r6, r7, lr}
 80016c2:       f7ff ff3f       bl      8001544 <__aeabi_f2d> !!!!!!!!!!!!
 80016c6:       4b0c            ldr     r3, [pc, #48]   @ (80016f8
<__aeabi_f2ulz+0x38>)
 80016c8:       2200            movs    r2, #0
 80016ca:       4606            mov     r6, r0
 80016cc:       460f            mov     r7, r1
 80016ce:       f7ff fcab       bl      8001028 <__aeabi_dmul>
 80016d2:       f7ff ff8f       bl      80015f4 <__aeabi_d2uiz>
 80016d6:       4604            mov     r4, r0
 80016d8:       f7ff ff12       bl      8001500 <__aeabi_ui2d>
 80016dc:       4b07            ldr     r3, [pc, #28]   @ (80016fc
<__aeabi_f2ulz+0x3c>)
 80016de:       2200            movs    r2, #0
 80016e0:       f7ff fca2       bl      8001028 <__aeabi_dmul>
 80016e4:       4602            mov     r2, r0
 80016e6:       460b            mov     r3, r1
 80016e8:       4630            mov     r0, r6
 80016ea:       4639            mov     r1, r7
 80016ec:       f7ff fdca       bl      8001284 <__aeabi_dsub>
 80016f0:       f7ff ff80       bl      80015f4 <__aeabi_d2uiz>
 80016f4:       4621            mov     r1, r4
 80016f6:       bdd0            pop     {r4, r6, r7, pc}
 80016f8:       3df00000        .word   0x3df00000
 80016fc:       41f00000        .word   0x41f00000

I think this is a libgcc problem. Newlib has nothing to do with it

^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug target/113155] large overhead for cast float to uint64_t. Arm cortex-m4 (ARMv7E-M, fpv4-sp-d16, ieee 754). Compiler: arm-none-eabi-gcc
  2023-12-27  7:03 [Bug libgcc/113155] New: large overhead for cast float to uint64_t. Arm cortex-m4 (ARMv7E-M, fpv4-sp-d16, ieee 754). Compiler: arm-none-eabi-gcc kirdyankinsp at gmail dot com
                   ` (4 preceding siblings ...)
  2023-12-28  5:07 ` kirdyankinsp at gmail dot com
@ 2023-12-28  6:22 ` pinskia at gcc dot gnu.org
  2023-12-28  6:48 ` kirdyankinsp at gmail dot com
  6 siblings, 0 replies; 8+ messages in thread
From: pinskia at gcc dot gnu.org @ 2023-12-28  6:22 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113155

--- Comment #5 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
Again how did you configure gcc because that changes which libgcc is being used
and with what options?


Also I suspect you should just be using -march=armv7e-m+fp instead.

Provide the output of:
`arm-none-eabi-gcc -v` should be enough.

^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bug target/113155] large overhead for cast float to uint64_t. Arm cortex-m4 (ARMv7E-M, fpv4-sp-d16, ieee 754). Compiler: arm-none-eabi-gcc
  2023-12-27  7:03 [Bug libgcc/113155] New: large overhead for cast float to uint64_t. Arm cortex-m4 (ARMv7E-M, fpv4-sp-d16, ieee 754). Compiler: arm-none-eabi-gcc kirdyankinsp at gmail dot com
                   ` (5 preceding siblings ...)
  2023-12-28  6:22 ` pinskia at gcc dot gnu.org
@ 2023-12-28  6:48 ` kirdyankinsp at gmail dot com
  6 siblings, 0 replies; 8+ messages in thread
From: kirdyankinsp at gmail dot com @ 2023-12-28  6:48 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113155

--- Comment #6 from Sergey Kirdyankin <kirdyankinsp at gmail dot com> ---
bin>arm-none-eabi-gcc -v
Using built-in specs.
COLLECT_GCC=arm-none-eabi-gcc
COLLECT_LTO_WRAPPER=C:/work/distributiv/arm-gnu-toolchain-13.2/bin/../libexec/gcc/arm-none-eabi/13.2.1/lto-wrapper.exe
Target: arm-none-eabi
Configured with: /data/jenkins/workspace/GNU-toolchain/arm-13/src/gcc/configure
--target=arm-none-eabi
--prefix=/data/jenkins/workspace/GNU-toolchain/arm-13/build-mingw-arm-none-eabi/install
--with-gmp=/data/jenkins/workspace/GNU-toolchain/arm-13/build-mingw-arm-none-eabi/host-tools
--with-mpfr=/data/jenkins/workspace/GNU-toolchain/arm-13/build-mingw-arm-none-eabi/host-tools
--with-mpc=/data/jenkins/workspace/GNU-toolchain/arm-13/build-mingw-arm-none-eabi/host-tools
--with-isl=/data/jenkins/workspace/GNU-toolchain/arm-13/build-mingw-arm-none-eabi/host-tools
--disable-shared --disable-nls --disable-threads --disable-tls
--enable-checking=release --enable-languages=c,c++,fortran --with-newlib
--with-gnu-as --with-headers=yes --with-gnu-ld
--with-native-system-header-dir=/include
--with-sysroot=/data/jenkins/workspace/GNU-toolchain/arm-13/build-mingw-arm-none-eabi/install/arm-none-eabi
--with-multilib-list=aprofile,rmprofile
--with-libiconv-prefix=/data/jenkins/workspace/GNU-toolchain/arm-13/build-mingw-arm-none-eabi/host-tools
--host=i686-w64-mingw32 --with-pkgversion='Arm GNU Toolchain 13.2.rel1 (Build
arm-13.7)' --with-bugurl=https://bugs.linaro.org/
Thread model: single
Supported LTO compression algorithms: zlib
gcc version 13.2.1 20231009 (Arm GNU Toolchain 13.2.rel1 (Build arm-13.7))

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2023-12-28  6:48 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-12-27  7:03 [Bug libgcc/113155] New: large overhead for cast float to uint64_t. Arm cortex-m4 (ARMv7E-M, fpv4-sp-d16, ieee 754). Compiler: arm-none-eabi-gcc kirdyankinsp at gmail dot com
2023-12-27  7:17 ` [Bug target/113155] " pinskia at gcc dot gnu.org
2023-12-27  7:23 ` pinskia at gcc dot gnu.org
2023-12-27  7:24 ` pinskia at gcc dot gnu.org
2023-12-27  7:34 ` pinskia at gcc dot gnu.org
2023-12-28  5:07 ` kirdyankinsp at gmail dot com
2023-12-28  6:22 ` pinskia at gcc dot gnu.org
2023-12-28  6:48 ` kirdyankinsp at gmail dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).