public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug c/54802] New: Trivial code changes result in different assembly with respect to rotations and bswap.
@ 2012-10-03 23:18 jasongross9+bugzilla at gmail dot com
  2021-07-26 21:02 ` [Bug middle-end/54802] " pinskia at gcc dot gnu.org
                   ` (2 more replies)
  0 siblings, 3 replies; 4+ messages in thread
From: jasongross9+bugzilla at gmail dot com @ 2012-10-03 23:18 UTC (permalink / raw)
  To: gcc-bugs


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=54802

             Bug #: 54802
           Summary: Trivial code changes result in different assembly with
                    respect to rotations and bswap.
    Classification: Unclassified
           Product: gcc
           Version: 4.8.0
            Status: UNCONFIRMED
          Severity: enhancement
          Priority: P3
         Component: c
        AssignedTo: unassigned@gcc.gnu.org
        ReportedBy: jasongross9+bugzilla@gmail.com


Created attachment 28347
  --> http://gcc.gnu.org/bugzilla/attachment.cgi?id=28347
Code files

In some C code, manually inlining constants changes whether or not gcc compiles
things to rotations or to bswaps.  In particular, the following code

uint64_t reverse0(uint64_t v) {
  v = ((v >> 1) & 0x5555555555555555ULL) | ((v & 0x5555555555555555ULL) << 1);
  v = ((v >> 2) & 0x3333333333333333ULL) | ((v & 0x3333333333333333ULL) << 2);
  v = ((v >> 4) & 0x0F0F0F0F0F0F0F0FULL) | ((v & 0x0F0F0F0F0F0F0F0FULL) << 4);
  v = ((v >> 8) & 0x00FF00FF00FF00FFULL) | ((v & 0x00FF00FF00FF00FFULL) << 8);
  v = ((v >> 16) & 0x0000FFFF0000FFFFULL) | ((v & 0x0000FFFF0000FFFFULL) <<
16);
  const uint64_t
      va = ((v >> 32) & 0x00000000FFFFFFFFULL),
      vb = ((v & 0x00000000FFFFFFFFULL) << 32);
  v = va | vb;
  return v;
}

uint64_t reverse1(uint64_t v) {
  v = ((v >> 1) & 0x5555555555555555ULL) | ((v & 0x5555555555555555ULL) << 1);
  v = ((v >> 2) & 0x3333333333333333ULL) | ((v & 0x3333333333333333ULL) << 2);
  v = ((v >> 4) & 0x0F0F0F0F0F0F0F0FULL) | ((v & 0x0F0F0F0F0F0F0F0FULL) << 4);
  v = ((v >> 8) & 0x00FF00FF00FF00FFULL) | ((v & 0x00FF00FF00FF00FFULL) << 8);
  v = ((v >> 16) & 0x0000FFFF0000FFFFULL) | ((v & 0x0000FFFF0000FFFFULL) <<
16);
  v = ((v >> 32) & 0x00000000FFFFFFFFULL) | ((v & 0x00000000FFFFFFFFULL) <<
32);
  return v;
}

compiles to 

reverse0:
.LFB8:
    .cfi_startproc
    movq    %rdi, %rdx
    movabsq    $6148914691236517205, %rax
    movabsq    $3689348814741910323, %rcx
    shrq    %rdx
    andq    %rax, %rdx
    andq    %rdi, %rax
    addq    %rax, %rax
    orq    %rdx, %rax
    movq    %rax, %rdx
    andq    %rcx, %rax
    shrq    $2, %rdx
    salq    $2, %rax
    andq    %rcx, %rdx
    movabsq    $1085102592571150095, %rcx
    orq    %rdx, %rax
    movq    %rax, %rdx
    andq    %rcx, %rax
    shrq    $4, %rdx
    salq    $4, %rax
    andq    %rcx, %rdx
    orq    %rdx, %rax
    bswap    %rax
    ret
    .cfi_endproc
.LFE8:
    .size    reverse0, .-reverse0
    .p2align 4,,15
    .globl    reverse1
    .type    reverse1, @function
reverse1:
.LFB9:
    .cfi_startproc
    movq    %rdi, %rdx
    movabsq    $6148914691236517205, %rax
    movabsq    $3689348814741910323, %rcx
    shrq    %rdx
    andq    %rax, %rdx
    andq    %rdi, %rax
    addq    %rax, %rax
    orq    %rdx, %rax
    movq    %rax, %rdx
    andq    %rcx, %rax
    shrq    $2, %rdx
    salq    $2, %rax
    andq    %rcx, %rdx
    movabsq    $1085102592571150095, %rcx
    orq    %rdx, %rax
    movq    %rax, %rdx
    andq    %rcx, %rax
    shrq    $4, %rdx
    salq    $4, %rax
    andq    %rcx, %rdx
    movabsq    $71777214294589695, %rcx
    orq    %rdx, %rax
    movq    %rax, %rdx
    andq    %rcx, %rax
    shrq    $8, %rdx
    salq    $8, %rax
    andq    %rcx, %rdx
    movabsq    $281470681808895, %rcx
    orq    %rdx, %rax
    movq    %rax, %rdx
    andq    %rcx, %rax
    shrq    $16, %rdx
    salq    $16, %rax
    andq    %rcx, %rdx
    orq    %rdx, %rax
    rorq    $32, %rax
    ret
    .cfi_endproc
.LFE9:
    .size    reverse1, .-reverse1
    .p2align 4,,15
    .globl    reverse2
    .type    reverse2, @function


In the code that I'm using this in, reverse0 is 30% faster than reverse1.  I
don't think that manual constant inlining, when each constant is used exactly
once, should change the assembly code that gcc compiles to.

The relevant (.c, .i, .s, and a log of the command line) files are attached.


^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug middle-end/54802] Trivial code changes result in different assembly with respect to rotations and bswap.
  2012-10-03 23:18 [Bug c/54802] New: Trivial code changes result in different assembly with respect to rotations and bswap jasongross9+bugzilla at gmail dot com
@ 2021-07-26 21:02 ` pinskia at gcc dot gnu.org
  2021-12-15 22:39 ` pinskia at gcc dot gnu.org
  2022-02-01 12:43 ` marxin at gcc dot gnu.org
  2 siblings, 0 replies; 4+ messages in thread
From: pinskia at gcc dot gnu.org @ 2021-07-26 21:02 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=54802

--- Comment #1 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
In GCC 5+, we are able to figure out both do bswap.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug middle-end/54802] Trivial code changes result in different assembly with respect to rotations and bswap.
  2012-10-03 23:18 [Bug c/54802] New: Trivial code changes result in different assembly with respect to rotations and bswap jasongross9+bugzilla at gmail dot com
  2021-07-26 21:02 ` [Bug middle-end/54802] " pinskia at gcc dot gnu.org
@ 2021-12-15 22:39 ` pinskia at gcc dot gnu.org
  2022-02-01 12:43 ` marxin at gcc dot gnu.org
  2 siblings, 0 replies; 4+ messages in thread
From: pinskia at gcc dot gnu.org @ 2021-12-15 22:39 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=54802

Andrew Pinski <pinskia at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
      Known to fail|                            |4.9.4
     Ever confirmed|0                           |1
   Last reconfirmed|                            |2021-12-15
           Keywords|                            |needs-bisection
      Known to work|                            |5.1.0
             Status|UNCONFIRMED                 |NEW

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug middle-end/54802] Trivial code changes result in different assembly with respect to rotations and bswap.
  2012-10-03 23:18 [Bug c/54802] New: Trivial code changes result in different assembly with respect to rotations and bswap jasongross9+bugzilla at gmail dot com
  2021-07-26 21:02 ` [Bug middle-end/54802] " pinskia at gcc dot gnu.org
  2021-12-15 22:39 ` pinskia at gcc dot gnu.org
@ 2022-02-01 12:43 ` marxin at gcc dot gnu.org
  2 siblings, 0 replies; 4+ messages in thread
From: marxin at gcc dot gnu.org @ 2022-02-01 12:43 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=54802

Martin Liška <marxin at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
         Resolution|---                         |FIXED
             Status|NEW                         |RESOLVED
                 CC|                            |marxin at gcc dot gnu.org

--- Comment #2 from Martin Liška <marxin at gcc dot gnu.org> ---
Fixed with r5-4562-gc6e3a931201a2ce4.

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2022-02-01 12:43 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2012-10-03 23:18 [Bug c/54802] New: Trivial code changes result in different assembly with respect to rotations and bswap jasongross9+bugzilla at gmail dot com
2021-07-26 21:02 ` [Bug middle-end/54802] " pinskia at gcc dot gnu.org
2021-12-15 22:39 ` pinskia at gcc dot gnu.org
2022-02-01 12:43 ` marxin at gcc dot gnu.org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).