public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug target/94687] New: PPC vector fails to optimize shift (used bits)
@ 2020-04-21 10:32 shawn at git dot icu
  2020-04-30 17:16 ` [Bug target/94687] " segher at gcc dot gnu.org
  2020-06-16 11:40 ` shawn at git dot icu
  0 siblings, 2 replies; 3+ messages in thread
From: shawn at git dot icu @ 2020-04-21 10:32 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94687

            Bug ID: 94687
           Summary: PPC vector fails to optimize shift (used bits)
           Product: gcc
           Version: 8.0
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: target
          Assignee: unassigned at gcc dot gnu.org
          Reporter: shawn at git dot icu
  Target Milestone: ---

https://godbolt.org/z/ZyTG9b


#include <altivec.h>

typedef vector unsigned __int128 block;
typedef vector unsigned long long vector2_u64;

block swap_with_shift(block num) {
    return num << 64 | num >> 64;
}

block swap_without_shift(block num) {
    vector unsigned long long ret;
    ret[0] = ((vector2_u64)num)[1];
    ret[1] = ((vector2_u64)num)[0];
    return (block)ret;
}

typedef unsigned __int128 u128;

u128 swap_scalar(u128 in) {
    return in << 64 | in >> 64;
}

swap_with_shift:
        xxpermdi 34,34,34,2
        addi 9,1,-16
        stxvd2x 34,0,9
        ld 8,-8(1)
        ld 9,-16(1)
        mtvsrd 1,8
        mtvsrd 0,9
        xxpermdi 34,0,1,0
        blr
        .long 0
        .byte 0,0,0,0,0,0,0,0
swap_without_shift:
        xxpermdi 34,34,34,2
        blr
        .long 0
        .byte 0,0,0,0,0,0,0,0
swap_scalar:
        mr 9,3
        mr 3,4
        mr 4,9
        blr
        .long 0
        .byte 0,0,0,0,0,0,0,0

^ permalink raw reply	[flat|nested] 3+ messages in thread

* [Bug target/94687] PPC vector fails to optimize shift (used bits)
  2020-04-21 10:32 [Bug target/94687] New: PPC vector fails to optimize shift (used bits) shawn at git dot icu
@ 2020-04-30 17:16 ` segher at gcc dot gnu.org
  2020-06-16 11:40 ` shawn at git dot icu
  1 sibling, 0 replies; 3+ messages in thread
From: segher at gcc dot gnu.org @ 2020-04-30 17:16 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94687

Segher Boessenkool <segher at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Last reconfirmed|                            |2020-04-30
             Status|UNCONFIRMED                 |NEW
                 CC|                            |segher at gcc dot gnu.org
     Ever confirmed|0                           |1

--- Comment #1 from Segher Boessenkool <segher at gcc dot gnu.org> ---
Confirmed.

At combine time we start with

insn_cost 4 for    25: r130:V1TI=%2:V1TI
      REG_DEAD %2:V1TI
insn_cost 4 for    20: r129:V1TI=r130:V1TI
      REG_DEAD r130:V1TI
insn_cost 4 for    21: r127:DI=r129:V1TI#0
insn_cost 4 for    22: r128:DI=r129:V1TI#8
      REG_DEAD r129:V1TI
insn_cost 4 for    24: r123:TI=0
insn_cost 4 for     7: r123:TI#8=r127:DI
      REG_DEAD r127:DI
insn_cost 4 for     8: r123:TI#0=r128:DI
      REG_DEAD r128:DI
insn_cost 4 for     9: r122:V1TI=r123:TI#0
      REG_DEAD r123:TI
insn_cost 4 for    14: %2:V1TI=r122:V1TI
      REG_DEAD r122:V1TI
insn_cost 0 for    15: use %2:V1TI

and those subregs at the lhs (insns 7 and 8) cannot combine with anything.

2-to-2 combine won't handle 20+21 (and then, 20+22) because 20 is a register
move already.  It would probably combine fine if that subreg lhs problem was
fixed though.

^ permalink raw reply	[flat|nested] 3+ messages in thread

* [Bug target/94687] PPC vector fails to optimize shift (used bits)
  2020-04-21 10:32 [Bug target/94687] New: PPC vector fails to optimize shift (used bits) shawn at git dot icu
  2020-04-30 17:16 ` [Bug target/94687] " segher at gcc dot gnu.org
@ 2020-06-16 11:40 ` shawn at git dot icu
  1 sibling, 0 replies; 3+ messages in thread
From: shawn at git dot icu @ 2020-06-16 11:40 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94687

--- Comment #2 from Shawn Landden <shawn at git dot icu> ---
LLVM fixed this by lowering to vector shuffle: https://dev.gnupg.org/D501

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2020-06-16 11:40 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-04-21 10:32 [Bug target/94687] New: PPC vector fails to optimize shift (used bits) shawn at git dot icu
2020-04-30 17:16 ` [Bug target/94687] " segher at gcc dot gnu.org
2020-06-16 11:40 ` shawn at git dot icu

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).