public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug tree-optimization/40771] generated code is ~25% slower when autovectorization is enabled
       [not found] <bug-40771-4@http.gcc.gnu.org/bugzilla/>
@ 2020-09-06 10:47 ` zsojka at seznam dot cz
  2024-04-03 23:36 ` [Bug target/40771] " pinskia at gcc dot gnu.org
  1 sibling, 0 replies; 2+ messages in thread
From: zsojka at seznam dot cz @ 2020-09-06 10:47 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=40771

--- Comment #3 from Zdenek Sojka <zsojka at seznam dot cz> ---
The vectorized code seems to have improved in gcc-9 over gcc-8.

^ permalink raw reply	[flat|nested] 2+ messages in thread

* [Bug target/40771] generated code is ~25% slower when autovectorization is enabled
       [not found] <bug-40771-4@http.gcc.gnu.org/bugzilla/>
  2020-09-06 10:47 ` [Bug tree-optimization/40771] generated code is ~25% slower when autovectorization is enabled zsojka at seznam dot cz
@ 2024-04-03 23:36 ` pinskia at gcc dot gnu.org
  1 sibling, 0 replies; 2+ messages in thread
From: pinskia at gcc dot gnu.org @ 2024-04-03 23:36 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=40771

--- Comment #4 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
AARCH64 vectorization looks decent too:
```
        dup     v31.8h, w0
        adrp    x2, .LC0
        adrp    x0, .LC1
        adrp    x1, .LANCHOR0
        ldr     q30, [x2, #:lo12:.LC0]
        ldr     q29, [x0, #:lo12:.LC1]
        add     v30.8h, v31.8h, v30.8h
        add     v29.8h, v31.8h, v29.8h
        uzp2    v29.16b, v30.16b, v29.16b
        str     q29, [x1, #:lo12:.LANCHOR0]
```

The only improvement that can be made there is with SVE, those ldr could be
`index` instructions instead but that is PR 113328 .

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2024-04-03 23:36 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <bug-40771-4@http.gcc.gnu.org/bugzilla/>
2020-09-06 10:47 ` [Bug tree-optimization/40771] generated code is ~25% slower when autovectorization is enabled zsojka at seznam dot cz
2024-04-03 23:36 ` [Bug target/40771] " pinskia at gcc dot gnu.org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).