From: "Kewen.Lin" <linkw@linux.ibm.com>
To: GCC Patches <gcc-patches@gcc.gnu.org>
Cc: Bill Schmidt <wschmidt@linux.ibm.com>,
Segher Boessenkool <segher@kernel.crashing.org>
Subject: PING^5 [PATCH v2] combine: Tweak the condition of last_set invalidation
Date: Wed, 20 Oct 2021 17:28:42 +0800 [thread overview]
Message-ID: <b06937d1-39a8-446f-ade7-3962cd8100cb@linux.ibm.com> (raw)
In-Reply-To: <feaf6eaa-2cf3-d7fd-082a-4581fd6de9b3@linux.ibm.com>
Hi,
Gentle ping this:
https://gcc.gnu.org/pipermail/gcc-patches/2021-June/572555.html
BR,
Kewen
>
>>>> on 2021/6/11 下午9:16, Kewen.Lin via Gcc-patches wrote:
>>>>> Hi Segher,
>>>>>
>>>>> Thanks for the review!
>>>>>
>>>>> on 2021/6/10 上午4:17, Segher Boessenkool wrote:
>>>>>> Hi!
>>>>>>
>>>>>> On Wed, Dec 16, 2020 at 04:49:49PM +0800, Kewen.Lin wrote:
>>>>>>> Currently we have the check:
>>>>>>>
>>>>>>> if (!insn
>>>>>>> || (value && rsp->last_set_table_tick >= label_tick_ebb_start))
>>>>>>> rsp->last_set_invalid = 1;
>>>>>>>
>>>>>>> which means if we want to record some value for some reg and
>>>>>>> this reg got refered before in a valid scope,
>>>>>>
>>>>>> If we already know it is *set* in this same extended basic block.
>>>>>> Possibly by the same instruction btw.
>>>>>>
>>>>>>> we invalidate the
>>>>>>> set of reg (last_set_invalid to 1). It avoids to find the wrong
>>>>>>> set for one reg reference, such as the case like:
>>>>>>>
>>>>>>> ... op regX // this regX could find wrong last_set below
>>>>>>> regX = ... // if we think this set is valid
>>>>>>> ... op regX
>>>>>>
>>>>>> Yup, exactly.
>>>>>>
>>>>>>> But because of retry's existence, the last_set_table_tick could
>>>>>>> be set by some later reference insns, but we see it's set due
>>>>>>> to retry on the set (for that reg) insn again, such as:
>>>>>>>
>>>>>>> insn 1
>>>>>>> insn 2
>>>>>>>
>>>>>>> regX = ... --> (a)
>>>>>>> ... op regX --> (b)
>>>>>>>
>>>>>>> insn 3
>>>>>>>
>>>>>>> // assume all in the same BB.
>>>>>>>
>>>>>>> Assuming we combine 1, 2 -> 3 sucessfully and replace them as two
>>>>>>> (3 insns -> 2 insns),
>>>>>>
>>>>>> This will delete insn 1 and write the combined result to insns 2 and 3.
>>>>>>
>>>>>>> retrying from insn1 or insn2 again:
>>>>>>
>>>>>> Always 2, but your point remains valid.
>>>>>>
>>>>>>> it will scan insn (a) again, the below condition holds for regX:
>>>>>>>
>>>>>>> (value && rsp->last_set_table_tick >= label_tick_ebb_start)
>>>>>>>
>>>>>>> it will mark this set as invalid set. But actually the
>>>>>>> last_set_table_tick here is set by insn (b) before retrying, so it
>>>>>>> should be safe to be taken as valid set.
>>>>>>
>>>>>> Yup.
>>>>>>
>>>>>>> This proposal is to check whether the last_set_table safely happens
>>>>>>> after the current set, make the set still valid if so.
>>>>>>
>>>>>>> Full SPEC2017 building shows this patch gets more sucessful combines
>>>>>>> from 1902208 to 1902243 (trivial though).
>>>>>>
>>>>>> Do you have some example, or maybe even a testcase? :-)
>>>>>>
>>>>>
>>>>> Sorry for the late reply, it took some time to get one reduced case.
>>>>>
>>>>> typedef struct SA *pa_t;
>>>>>
>>>>> struct SC {
>>>>> int h;
>>>>> pa_t elem[];
>>>>> };
>>>>>
>>>>> struct SD {
>>>>> struct SC *e;
>>>>> };
>>>>>
>>>>> struct SA {
>>>>> struct {
>>>>> struct SD f[1];
>>>>> } g;
>>>>> };
>>>>>
>>>>> void foo(pa_t *k, char **m) {
>>>>> int l, i;
>>>>> pa_t a;
>>>>> l = (int)a->g.f[5].e;
>>>>> i = 0;
>>>>> for (; i < l; i++) {
>>>>> k[i] = a->g.f[5].e->elem[i];
>>>>> m[i] = "";
>>>>> }
>>>>> }
>>>>>
>>>>> Baseline is r12-0 and the option is "-O3 -mcpu=power9 -fno-strict-aliasing",
>>>>> with this patch, the generated assembly can save two rlwinm s.
>>>>>
>>>>>>> + /* Record the luid of the insn whose expression involving register n. */
>>>>>>> +
>>>>>>> + int last_set_table_luid;
>>>>>>
>>>>>> "Record the luid of the insn for which last_set_table_tick was set",
>>>>>> right?
>>>>>>
>>>>>
>>>>> But it can be updated later to one smaller luid, how about the wording like:
>>>>>
>>>>>
>>>>> + /* Record the luid of the insn which uses register n, the insn should
>>>>> + be the first one using register n in that block of the insn which
>>>>> + last_set_table_tick was set for. */
>>>>>
>>>>>
>>>>>>> -static void update_table_tick (rtx);
>>>>>>> +static void update_table_tick (rtx, int);
>>>>>>
>>>>>> Please remove this declaration instead, the function is not used until
>>>>>> after its actual definition :-)
>>>>>>
>>>>>
>>>>> Done.
>>>>>
>>>>>>> @@ -13243,7 +13247,21 @@ update_table_tick (rtx x)
>>>>>>> for (r = regno; r < endregno; r++)
>>>>>>> {
>>>>>>> reg_stat_type *rsp = ®_stat[r];
>>>>>>> - rsp->last_set_table_tick = label_tick;
>>>>>>> + if (rsp->last_set_table_tick >= label_tick_ebb_start)
>>>>>>> + {
>>>>>>> + /* Later references should not have lower ticks. */
>>>>>>> + gcc_assert (label_tick >= rsp->last_set_table_tick);
>>>>>>
>>>>>> This should be obvious, but checking it won't hurt, okay.
>>>>>>
>>>>>>> + /* Should pick up the lowest luid if the references
>>>>>>> + are in the same block. */
>>>>>>> + if (label_tick == rsp->last_set_table_tick
>>>>>>> + && rsp->last_set_table_luid > insn_luid)
>>>>>>> + rsp->last_set_table_luid = insn_luid;
>>>>>>
>>>>>> Why? Is it conservative for the check you will do later? Please spell
>>>>>> this out, it is crucial!
>>>>>>
>>>>>
>>>>> Since later the combinations involving this insn probably make the
>>>>> register be used in one insn sitting ahead (which has smaller luid than
>>>>> the one which was recorded before). Yes, it's very conservative, this
>>>>> ensure that we always use the luid of the insn which is the first insn
>>>>> using this register in the block. The last_set invalidation is going
>>>>> to catch the case like:
>>>>>
>>>>> ... regX // avoid the set used here ...
>>>>> regX = ...
>>>>> ...
>>>>>
>>>>> Once we have the smallest luid one of all insns which use register X,
>>>>> any unsafe regX sets should be caught.
>>>>>
>>>>> I updated the comments to:
>>>>>
>>>>> + /* Since combination may generate some instructions
>>>>> + to replace some foregoing instructions with the
>>>>> + references to register r (using register r), we
>>>>> + need to make sure we record the first instruction
>>>>> + which is using register r, so always update with
>>>>> + the lowest luid here. If the given set happens
>>>>> + before this recorded earliest reference, the set
>>>>> + value should be safe to be used. */
>>>>>
>>>>>>> @@ -13359,7 +13378,10 @@ record_value_for_reg (rtx reg, rtx_insn *insn, rtx value)
>>>>>>>
>>>>>>> /* Mark registers that are being referenced in this value. */
>>>>>>> if (value)
>>>>>>> - update_table_tick (value);
>>>>>>> + {
>>>>>>> + gcc_assert (insn);
>>>>>>> + update_table_tick (value, DF_INSN_LUID (insn));
>>>>>>> + }
>>>>>>
>>>>>> Don't add that assert please. If you really want one it should come
>>>>>> right at the start of the function, not 60 lines later :-)
>>>>>>
>>>>>
>>>>> Exactly, fixed.
>>>>>
>>>>>> Looks good if I understood this correctly :-)
>>>>>>
>>>>>>
>>>>>
>>>>> Thanks again, I also updated the comments in func record_value_for_reg,
>>>>> the new version is attached.
>>>>>
>>>>> BR,
>>>>> Kewen
>>>>> -----
>>>>> gcc/ChangeLog:
>>>>>
>>>>> * combine.c (struct reg_stat_type): New member
>>>>> last_set_table_luid.
>>>>> (update_table_tick): Add one argument for insn luid and
>>>>> set last_set_table_luid with it, remove its declaration.
>>>>> (record_value_for_reg): Adjust the condition to set
>>>>> last_set_invalid nonzero.
>>>>>
next prev parent reply other threads:[~2021-10-20 9:28 UTC|newest]
Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-12-16 8:49 [PATCH/RFC] " Kewen.Lin
2021-01-14 2:29 ` PING^1 " Kewen.Lin
2021-01-15 0:22 ` Segher Boessenkool
2021-01-15 8:06 ` Kewen.Lin
2021-01-22 0:30 ` Segher Boessenkool
2021-01-22 2:21 ` Kewen.Lin
2021-05-07 2:45 ` Kewen.Lin
2021-05-26 3:04 ` PING^2 " Kewen.Lin
2021-06-09 2:32 ` PING^3 " Kewen.Lin
2021-06-09 20:17 ` Segher Boessenkool
2021-06-11 13:16 ` [PATCH v2] " Kewen.Lin
2021-06-28 7:00 ` PING^1 " Kewen.Lin
2021-07-15 2:00 ` PING^2 " Kewen.Lin
2021-09-08 7:03 ` PING^3 " Kewen.Lin
2021-10-13 2:27 ` PING^4 " Kewen.Lin
2021-10-20 9:28 ` Kewen.Lin [this message]
2021-11-04 10:56 ` PING^6 " Kewen.Lin
2021-11-22 2:22 ` PING^7 " Kewen.Lin
2021-11-29 22:28 ` Segher Boessenkool
2021-11-30 8:47 ` Kewen.Lin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=b06937d1-39a8-446f-ade7-3962cd8100cb@linux.ibm.com \
--to=linkw@linux.ibm.com \
--cc=gcc-patches@gcc.gnu.org \
--cc=segher@kernel.crashing.org \
--cc=wschmidt@linux.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).