public inbox for gcc-cvs@sourceware.org help / color / mirror / Atom feed
From: Michael Meissner <meissner@gcc.gnu.org> To: gcc-cvs@gcc.gnu.org Subject: [gcc(refs/users/meissner/heads/work146-vpair)] Update ChangeLog.* Date: Fri, 17 Nov 2023 20:47:48 +0000 (GMT) [thread overview] Message-ID: <20231117204748.808F83858D28@sourceware.org> (raw) https://gcc.gnu.org/g:eb78fd5bd87b224e0bba0dd9e08b16f78387543f commit eb78fd5bd87b224e0bba0dd9e08b16f78387543f Author: Michael Meissner <meissner@linux.ibm.com> Date: Fri Nov 17 15:47:45 2023 -0500 Update ChangeLog.* Diff: --- gcc/ChangeLog.vpair | 350 ++++++++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 350 insertions(+) diff --git a/gcc/ChangeLog.vpair b/gcc/ChangeLog.vpair index 59844228c32..e7cbdc06bc4 100644 --- a/gcc/ChangeLog.vpair +++ b/gcc/ChangeLog.vpair @@ -1,5 +1,355 @@ +==================== Branch work146-vpair, patch #205 ==================== + +Add overloads for __builtin_vpair_assemble. + +2023-11-17 Michael Meissner <meissner@linux.ibm.com> + +gcc/ + + * config/rs6000/rs6000-overloads.def (__builtin_vpair_assemble): Add + overloads. + +==================== Branch work146-vpair, patch #204 ==================== + +Add support for doing a horizontal add on vector pair elements. + +This patch adds a series of built-in functions to allow users to write code to +do a number of simple operations where the loop is done using the __vector_pair +type. The __vector_pair type is an opaque type. These built-in functions keep +the two 128-bit vectors within the __vector_pair together, and split the +operation after register allocation. + +This patch provides vector pair built-in functions to do a horizontal add on +vector pair elements. Only floating point and 64-bit horizontal adds are +provided in this patch. + +I have built and tested these patches on: + + * A little endian power10 server using --with-cpu=power10 + * A little endian power9 server using --with-cpu=power9 + * A big endian power9 server using --with-cpu=power9. + +Can I check this patch into the master branch after the preceeding patches have +been checked in? + +2023-11-17 Michael Meissner <meissner@linux.ibm.com> + +gcc/ + + * config/rs6000/rs6000-builtins.def (__builtin_vpair_f32_add_elements): + New built-in function. + (__builtin_vpair_f64_add_elements): Likewise. + (__builtin_vpair_i64_add_elements): Likewise. + (__builtin_vpair_i64u_add_elements): Likewise. + * config/rs6000/vector-pair.md (UNSPEC_VPAIR_REDUCE_PLUS_F32): New + unspec. + (UNSPEC_VPAIR_REDUCE_PLUS_F64): Likewise. + (UNSPEC_VPAIR_REDUCE_PLUS_I64): Likewise. + (vpair_reduc_plus_scale_v8sf): New insn. + (vpair_reduc_plus_scale_v4df): Likewise. + (vpair_reduc_plus_scale_v4di): Likewise. + * doc/extend.texi (__builtin_vpair_f32_add_elements): Document. + (__builtin_vpair_f64_add_elements): Likewise. + (__builtin_vpair_i64_add_elements): Likewise. + +gcc/testsuite/ + + * gcc.target/powerpc/vector-pair-16.c: New test. + +==================== Branch work146-vpair, patch #203 ==================== + +Add support for initializing and extracting from vector pairs. + +This patch adds a series of built-in functions to allow users to write code to +do a number of simple operations where the loop is done using the __vector_pair +type. The __vector_pair type is an opaque type. These built-in functions keep +the two 128-bit vectors within the __vector_pair together, and split the +operation after register allocation. + +This patch provides vector pair operations for loading up a vector pair with all +0's, duplicated (splat) from a scalar type, or combining two vectors in a vector +pair. This patch also provides vector pair builtins to extract one vector +element of a vector pair. + +I have built and tested these patches on: + + * A little endian power10 server using --with-cpu=power10 + * A little endian power9 server using --with-cpu=power9 + * A big endian power9 server using --with-cpu=power9. + +Can I check this patch into the master branch after the preceeding patches have +been checked in? + +2023-11-17 Michael Meissner <meissner@linux.ibm.com> + +gcc/ + + * config/rs6000/predicates.md (mma_assemble_input_operand): Allow any + 16-byte vector, not just V16QImode. + * config/rs6000/rs6000-builtins.def (__builtin_vpair_zero): New vector + pair initialization built-in functions. + (__builtin_vpair_*_assemble): Likeise. + (__builtin_vpair_*_splat): Likeise. + (__builtin_vpair_*_extract_vector): New vector pair extraction built-in + functions. + * config/rs6000/vector-pair.md (UNSPEC_VPAIR_V32QI): New unspec. + (UNSPEC_VPAIR_V16HI): Likewise. + (UNSPEC_VPAIR_V8SI): Likewise. + (UNSPEC_VPAIR_V4DI): Likewise. + (VP_INT_BINARY): New iterator for integer vector pair. + (vp_insn): Add supoort for integer vector pairs. + (vp_ireg): New code attribute for integer vector pairs. + (vp_ipredicate): Likewise. + (VP_INT): New int interator for integer vector pairs. + (VP_VEC_MODE): Likewise. + (vp_pmode): Likewise. + (vp_vmode): Likewise. + (vp_neg_reg): New int interator for integer vector pairs. + (vpair_neg_<vp_pmode>): Add integer vector pair support insns. + (vpair_not_<vp_pmode>2): Likewise. + (vpair_<vp_insn>_<vp_pmode>3): Likewise. + (vpair_andc_<vp_pmode): Likewise. + (vpair_iorc_<vp_pmode>): Likewise. + (vpair_nand_<vp_pmode>_1): Likewise. + (vpair_nand_<vp_pmode>_2): Likewise. + (vpair_nor_<vp_pmode>_1): Likewise. + (vpair_nor_<vp_pmode>_2): Likewise. + * doc/extend.texi (PowerPC Vector Pair Built-in Functions): Document the + integer vector pair built-in functions. + +gcc/testsuite/ + + * gcc.target/powerpc/vector-pair-5.c: New test. + * gcc.target/powerpc/vector-pair-6.c: New test. + * gcc.target/powerpc/vector-pair-7.c: New test. + * gcc.target/powerpc/vector-pair-8.c: New test. + + +==================== Branch work146-vpair, patch #202 ==================== + +Add support for integer point vector pair built-in functions. + +This patch adds a series of built-in functions to allow users to write code to +do a number of simple operations where the loop is done using the __vector_pair +type. The __vector_pair type is an opaque type. These built-in functions keep +the two 128-bit vectors within the __vector_pair together, and split the +operation after register allocation. + +This patch provides vector pair operations for 8, 16, 32, and 64-bit integers. + +I have built and tested these patches on: + + * A little endian power10 server using --with-cpu=power10 + * A little endian power9 server using --with-cpu=power9 + * A big endian power9 server using --with-cpu=power9. + +Can I check this patch into the master branch after the preceeding patch is +checked in? + +2023-11-17 Michael Meissner <meissner@linux.ibm.com> + +gcc/ + + * config/rs6000/rs6000-builtins.def (__builtin_vpair_i8*): Add built-in + functions for integer vector pairs. + (__builtin_vpair_i16*): Likeise. + (__builtin_vpair_i32*): Likeise. + (__builtin_vpair_i64*): Likeise. + * config/rs6000/vector-pair.md (UNSPEC_VPAIR_V32QI): New unspec. + (UNSPEC_VPAIR_V16HI): Likewise. + (UNSPEC_VPAIR_V8SI): Likewise. + (UNSPEC_VPAIR_V4DI): Likewise. + (VP_INT_BINARY): New iterator for integer vector pair. + (vp_insn): Add supoort for integer vector pairs. + (vp_ireg): New code attribute for integer vector pairs. + (vp_ipredicate): Likewise. + (VP_INT): New int interator for integer vector pairs. + (VP_VEC_MODE): Likewise. + (vp_pmode): Likewise. + (vp_vmode): Likewise. + (vp_neg_reg): New int interator for integer vector pairs. + (vpair_neg_<vp_pmode>): Add integer vector pair support insns. + (vpair_not_<vp_pmode>2): Likewise. + (vpair_<vp_insn>_<vp_pmode>3): Likewise. + (vpair_andc_<vp_pmode): Likewise. + (*vpair_iorc_<vp_pmode>): Likewise. + (vpair_nand_<vp_pmode>_1): Likewise. + (vpair_nand_<vp_pmode>_2): Likewise. + (vpair_nor_<vp_pmode>_1): Likewise. + (vpair_nor_<vp_pmode>_2): Likewise. + * doc/extend.texi (PowerPC Vector Pair Built-in Functions): Document the + integer vector pair built-in functions. + +gcc/testsuite/ + + * gcc.target/powerpc/vector-pair-5.c: New test. + * gcc.target/powerpc/vector-pair-6.c: New test. + * gcc.target/powerpc/vector-pair-7.c: New test. + * gcc.target/powerpc/vector-pair-8.c: New test. + + +==================== Branch work146-vpair, patch #201 ==================== + +Add support for floating point vector pair built-in functions. + +This patch adds a series of built-in functions to allow users to write code to +do a number of simple operations where the loop is done using the __vector_pair +type. The __vector_pair type is an opaque type. These built-in functions keep +the two 128-bit vectors within the __vector_pair together, and split the +operation after register allocation. + +This patch provides vector pair operations for 32-bit floating point and 64-bit +floating point. + +2023-11-17 Michael Meissner <meissner@linux.ibm.com> + +gcc/ + + * config/rs6000/rs6000-builtins.def (__builtin_vpair_f32_*): Add vector + pair built-in functions for float. + (__builtin_vpair_f64_*): Add vector pair built-in functions for double. + * config/rs6000/rs6000-protos.h (split_unary_vector_pair): Add + declaration. + (split_binary_vector_pair): Likewise. + (split_fma_vector_pair): Likewise. + * config/rs6000/rs6000.cc (split_unary_vector_pair): New helper function + for vector pair built-in functions. + (split_binary_vector_pair): Likewise. + (split_fma_vector_pair): Likewise. + * config/rs6000/rs6000.md (toplevel): Include vector-pair.md. + * config/rs6000/t-rs6000 (MD_INCLUDES): Add vector-pair.md. + * config/rs6000/vector-pair.md: New file. + * doc/extend.texi (PowerPC Vector Pair Built-in Functions): Document the + floating point and general vector pair built-in functions. + +gcc/testsuite/ + + * gcc.target/powerpc/vector-pair-1.c: New test. + * gcc.target/powerpc/vector-pair-2.c: New test. + * gcc.target/powerpc/vector-pair-3.c: New test. + * gcc.target/powerpc/vector-pair-4.c: New test. + +==================== Branch work146-vpair, patch #1 (main branch) ==================== + +Power10: Add options to disable load and store vector pair. + +This is version 2 of the patch to add -mno-load-vector-pair and +-mno-store-vector-pair undocumented tuning switches. + +The differences between the first version of the patch and this version is that +I added explicit RTL abi attributes for when the compiler can generate the load +vector pair and store vector pair instructions. By having this attribute, the +movoo insn has separate alternatives for when we generate the instruction and +when we want to split the instruction into 2 separate vector loads or stores. + +In the first version of the patch, I had previously provided built-in functions +that would always generate load vector pair and store vector pair instructions +even if these instructions are normally disabled. I found these built-ins +weren't specified like the other vector pair built-ins, and I didn't include +documentation for the built-in functions. If we want such built-in functions, +we can add them as a separate patch later. + +In addition, since both versions of the patch adds #pragma target and attribute +support to change the results for individual functions, we can select on a +function by function basis what the defaults for load/store vector pair is. + +The original text for the patch is: + +In working on some future patches that involve utilizing vector pair +instructions, I wanted to be able to tune my program to enable or disable using +the vector pair load or store operations while still keeping the other +operations on the vector pair. + +This patch adds two undocumented tuning options. The -mno-load-vector-pair +option would tell GCC to generate two load vector instructions instead of a +single load vector pair. The -mno-store-vector-pair option would tell GCC to +generate two store vector instructions instead of a single store vector pair. + +If either -mno-load-vector-pair is used, GCC will not generate the indexed +stxvpx instruction. Similarly if -mno-store-vector-pair is used, GCC will not +generate the indexed lxvpx instruction. The reason for this is to enable +splitting the {,p}lxvp or {,p}stxvp instructions after reload without needing a +scratch GPR register. + +The default for -mcpu=power10 is that both load vector pair and store vector +pair are enabled. + +I added code so that the user code can modify these settings using either a +'#pragma GCC target' directive or used __attribute__((__target__(...))) in the +function declaration. + +I added tests for the switches, #pragma, and attribute options. + +I have built this on both little endian power10 systems and big endian power9 +systems doing the normal bootstrap and test. There were no regressions in any +of the tests, and the new tests passed. Can I check this patch into the master +branch? + +2023-11-17 Michael Meissner <meissner@linux.ibm.com> + +gcc/ + + * config/rs6000/mma.md (movoo): Add support for -mno-load-vector-pair and + -mno-store-vector-pair. + * config/rs6000/rs6000-cpus.def (OTHER_POWER10_MASKS): Add support for + -mload-vector-pair and -mstore-vector-pair. + (POWERPC_MASKS): Likewise. + * config/rs6000/rs6000.cc (rs6000_setup_reg_addr_masks): Only allow + indexed mode for OOmode if we are generating both load vector pair and + store vector pair instructions. + (rs6000_option_override_internal): Add support for -mno-load-vector-pair + and -mno-store-vector-pair. + (rs6000_opt_masks): Likewise. + * config/rs6000/rs6000.md (isa attribute): Add lxvp and stxvp + attributes. + (enabled attribute): Likewise. + * config/rs6000/rs6000.opt (-mload-vector-pair): New option. + (-mstore-vector-pair): Likewise. + +gcc/testsuite/ + + * gcc.target/powerpc/vector-pair-attribute.c: New test. + * gcc.target/powerpc/vector-pair-pragma.c: New test. + * gcc.target/powerpc/vector-pair-switch1.c: New test. + * gcc.target/powerpc/vector-pair-switch2.c: New test. + * gcc.target/powerpc/vector-pair-switch3.c: New test. + * gcc.target/powerpc/vector-pair-switch4.c: New test. + ==================== Branch work146-vpair, baseline ==================== +Add ChangeLog.meissner and REVISION. + +2023-11-17 Michael Meissner <meissner@linux.ibm.com> + +gcc/ + + * REVISION: New file for branch. + * ChangeLog.meissner: New file. + +gcc/c-family/ + + * ChangeLog.meissner: New file. + +gcc/c/ + + * ChangeLog.meissner: New file. + +gcc/cp/ + + * ChangeLog.meissner: New file. + +gcc/fortran/ + + * ChangeLog.meissner: New file. + +gcc/testsuite/ + + * ChangeLog.meissner: New file. + +libgcc/ + + * ChangeLog.meissner: New file. + 2023-11-17 Michael Meissner <meissner@linux.ibm.com> Clone branch
next reply other threads:[~2023-11-17 20:47 UTC|newest] Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top 2023-11-17 20:47 Michael Meissner [this message] 2023-11-18 2:31 Michael Meissner
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=20231117204748.808F83858D28@sourceware.org \ --to=meissner@gcc.gnu.org \ --cc=gcc-cvs@gcc.gnu.org \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: linkBe sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).