An asm constraint issue (ARM FPU)

public inbox for gcc@gcc.gnu.org
 help / color / mirror / Atom feed

* An asm  constraint issue (ARM FPU)
@ 2021-07-25  2:36 Zoltán Kócsi
  2021-07-25 12:19 ` Marc Glisse
  0 siblings, 1 reply; 3+ messages in thread
From: Zoltán Kócsi @ 2021-07-25  2:36 UTC (permalink / raw)
  To: gcc

I try to write a one-liner inline function to create a double form
a 64-bit integer, not converting it to a double but the integer
containing the bit pattern for the double (type spoofing).

The compiler is arm-eabi-gcc 8.2.0.
The target is a Cortex-A9, with NEON.

According to the info page the assembler constraint "w" denotes an FPU
double register, d0 - d31.

The code is the following:

double spoof( uint64_t x )
{
double r;

   asm volatile
   (
     " vmov.64 %[d],%Q[i],%R[i] \n"
     : [d] "=w" (r)
     : [i] "q" (x)
   );

   return r;
}

The command line:

arm-eabi-gcc -O0 -c -mcpu=cortex-a9 -mfloat-abi=hard -mfpu=neon-vfpv4 \
test.c

It compiles and the generated object code is this:

00000000 <spoof>:
   0:   e52db004        push    {fp}            ; (str fp, [sp, #-4]!)
   4:   e28db000        add     fp, sp, #0
   8:   e24dd014        sub     sp, sp, #20
   c:   e14b01f4        strd    r0, [fp, #-20]  ; 0xffffffec
  10:   e14b21d4        ldrd    r2, [fp, #-20]  ; 0xffffffec
  14:   ec432b30        vmov    d16, r2, r3
  18:   ed4b0b03        vstr    d16, [fp, #-12]
  1c:   e14b20dc        ldrd    r2, [fp, #-12]
  20:   ec432b30        vmov    d16, r2, r3
  24:   eeb00b60        vmov.f64        d0, d16
  28:   e28bd000        add     sp, fp, #0
  2c:   e49db004        pop     {fp}            ; (ldr fp, [sp], #4)
  30:   e12fff1e        bx      lr

which is not really efficient, but works.

However, if I specify -O1, -O2 or -Os then the compilation fails
because assembler complains. This is the assembly the compiler
generated, (comments and irrelevant stuff removed):

spoof:
   vmov.64 s0,r0,r1
   bx lr

where the problem is that 's0' is a single-precision float register and
it should be 'd0' instead.

Either I'm seriously missing something, in which case I would be most
obliged if someone sent me to the right direction; or it is a compiler
or documentation bug.

Thanks,

Zoltan

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: An asm  constraint issue (ARM FPU)
  2021-07-25  2:36 An asm constraint issue (ARM FPU) Zoltán Kócsi
@ 2021-07-25 12:19 ` Marc Glisse
  2021-07-29 21:26   ` Zoltán Kócsi
  0 siblings, 1 reply; 3+ messages in thread
From: Marc Glisse @ 2021-07-25 12:19 UTC (permalink / raw)
  To: Zoltán Kócsi; +Cc: gcc

On Sun, 25 Jul 2021, Zoltán Kócsi wrote:

> I try to write a one-liner inline function to create a double form
> a 64-bit integer, not converting it to a double but the integer
> containing the bit pattern for the double (type spoofing).
>
> The compiler is arm-eabi-gcc 8.2.0.
> The target is a Cortex-A9, with NEON.
>
> According to the info page the assembler constraint "w" denotes an FPU
> double register, d0 - d31.
>
> The code is the following:
>
> double spoof( uint64_t x )
> {
> double r;
>
>   asm volatile
>   (
>     " vmov.64 %[d],%Q[i],%R[i] \n"

Isn't it supposed to be %P[d] for a double?
(the documentation is very lacking...)

>     : [d] "=w" (r)
>     : [i] "q" (x)
>   );
>
>   return r;
> }
>
> The command line:
>
> arm-eabi-gcc -O0 -c -mcpu=cortex-a9 -mfloat-abi=hard -mfpu=neon-vfpv4 \
> test.c
>
> It compiles and the generated object code is this:
>
> 00000000 <spoof>:
>   0:   e52db004        push    {fp}            ; (str fp, [sp, #-4]!)
>   4:   e28db000        add     fp, sp, #0
>   8:   e24dd014        sub     sp, sp, #20
>   c:   e14b01f4        strd    r0, [fp, #-20]  ; 0xffffffec
>  10:   e14b21d4        ldrd    r2, [fp, #-20]  ; 0xffffffec
>  14:   ec432b30        vmov    d16, r2, r3
>  18:   ed4b0b03        vstr    d16, [fp, #-12]
>  1c:   e14b20dc        ldrd    r2, [fp, #-12]
>  20:   ec432b30        vmov    d16, r2, r3
>  24:   eeb00b60        vmov.f64        d0, d16
>  28:   e28bd000        add     sp, fp, #0
>  2c:   e49db004        pop     {fp}            ; (ldr fp, [sp], #4)
>  30:   e12fff1e        bx      lr
>
> which is not really efficient, but works.
>
> However, if I specify -O1, -O2 or -Os then the compilation fails
> because assembler complains. This is the assembly the compiler
> generated, (comments and irrelevant stuff removed):
>
> spoof:
>   vmov.64 s0,r0,r1
>   bx lr
>
> where the problem is that 's0' is a single-precision float register and
> it should be 'd0' instead.
>
> Either I'm seriously missing something, in which case I would be most
> obliged if someone sent me to the right direction; or it is a compiler
> or documentation bug.
>
> Thanks,
>
> Zoltan

-- 
Marc Glisse

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: An asm  constraint issue (ARM FPU)
  2021-07-25 12:19 ` Marc Glisse
@ 2021-07-29 21:26   ` Zoltán Kócsi
  0 siblings, 0 replies; 3+ messages in thread
From: Zoltán Kócsi @ 2021-07-29 21:26 UTC (permalink / raw)
  To: gcc

Dear Marc,

Sorry for the late answer, I was away for a few days.
Yes, that fixes it. THANK YOU!

Do you know which gcc source file contains the magic qualifiers for the
asm arguments? I wouldn't mind to go through the code and extract what
I can. Probably I'd find a couple of gems that are useful for inline
asm stuff. Maybe even write the info pages that describe them, so
that others can make use of them...

Thanks again,

Best Regards,

Zoltan

On Sun, 25 Jul 2021 14:19:56 +0200 (CEST)
Marc Glisse <marc.glisse@inria.fr> wrote:

> On Sun, 25 Jul 2021, Zoltán Kócsi wrote:
> 
> > [...]
> > double spoof( uint64_t x )
> > {
> > double r;
> >
> >   asm volatile
> >   (
> >     " vmov.64 %[d],%Q[i],%R[i] \n"  
> 
> Isn't it supposed to be %P[d] for a double?
> (the documentation is very lacking...)
> 
> > [...]
> -- 
> Marc Glisse

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2021-07-29 21:26 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-07-25  2:36 An asm constraint issue (ARM FPU) Zoltán Kócsi
2021-07-25 12:19 ` Marc Glisse
2021-07-29 21:26   ` Zoltán Kócsi

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).