From: "Kewen.Lin" <linkw@linux.ibm.com>
To: "Bin.Cheng" <amker.cheng@gmail.com>
Cc: Segher Boessenkool <segher@kernel.crashing.org>,
GCC Patches <gcc-patches@gcc.gnu.org>,
"bin.cheng" <bin.cheng@linux.alibaba.com>,
Bill Schmidt <wschmidt@linux.ibm.com>,
Richard Guenther <rguenther@suse.de>,
Jakub Jelinek <jakub@redhat.com>
Subject: Re: [PATCH, RFC, rs6000] PR80791 Consider doloop in ivopts
Date: Sun, 05 May 2019 05:26:00 -0000 [thread overview]
Message-ID: <43ad1524-3592-c2f4-57c4-a44795330658@linux.ibm.com> (raw)
In-Reply-To: <CAHFci297crdbwqk7HnYDz5bz9SmDPsx7cYfX0EADnbCZdtkcWA@mail.gmail.com>
on 2019/4/27 ä¸å11:44, Bin.Cheng wrote:
> On Fri, Apr 26, 2019 at 2:44 PM Kewen.Lin <linkw@linux.ibm.com> wrote:
>> +
>> + /* zero cost use makes it easier to select memory based iv cand
>> + for replacement of non memory based iv and its use. But if
>> + the setup sequence are too costly, loop iv analysis can NOT
>> + easily figure out it's finite, it's possible to stop the
>> + low-overhead loop transformation and get unexpected code. */
>> + if (use->zero_cost_p && cand->iv->base_object && !use->iv->base_object
>> + && elim_cost.cost >= 30)
>> + dont_elim_p = true;
> No, we'd like to avoid such things in general. The conditions look
> like a hack to me. elim_cost is compared to express_cost, adding
> another check on it at different place isn't really good, especially
> 30 itself is a magic number. It's most likely improvement in some
> cases, deterioration in others.
>
Yes, I agree it's too hacky and unacceptable as a formal fix. And I
tried to investigate more whether it's a general issue but never got
exposed.
> Also it punishes one pass (IVOPTs here) because of other pass' (RTL)
> problem. It does't mean we can't do such transformations, only it has
> to be as precise/conservative as possible. For example, if RTL loop
> iv is improved to handle the case in the future, who would remember to
> come back and adjust this?
>
Good question!
> GCC lacks the capability passing information to later passes. Gimple
> analyzer worked hard collecting various information but discards it
> entering RTL or earlier. Other examples are like runtime alias
> information, non-wrapping information for specific operations, etc.
> IMHO, this is what needs to be done. As for this case, it could be
> finite loop info, or non-wrapping info of the iv_var's increment
> operation. By passing more information, RTL passes can be simplified
> too.
>
Thanks for the information! Is there any under development work for this?
That would be fine if we can pass down those information to downstream
passes based on upcoming feature.
> Thanks,
> bin
>> +
>> /* The bound is a loop invariant, so it will be only computed
>> once. */
>> elim_cost.cost = adjust_setup_cost (data, elim_cost.cost);
>> @@ -5184,7 +5195,7 @@ determine_group_iv_cost_cond (struct ivopts_data *data,
>> express_cost += bound_cost;
>>
>> /* Choose the better approach, preferring the eliminated IV. */
>> - if (elim_cost <= express_cost)
>> + if (elim_cost <= express_cost && !dont_elim_p)
>> {
>>
>>
>> I was thinking whether this zero cost change just exposed
>> an existing problem, then this kind of check should be for all
>> cases not only for zero cost use, similar to
>> expression_expensive_p? But why doesn't it happen before?
>> Need more investigation.
>>
>>>
>>>> Btw, this is for GCC10.
>>>
>>> *Phew* ;-)
>>>
>>>
>>> Some trivial comments:
>>>
>>>> +static bool
>>>> +invalid_insn_for_doloop_p (struct loop *loop)
>>>> +{
>>>> + basic_block *body = get_loop_body (loop);
>>>> + unsigned num_nodes = loop->num_nodes;
>>>> + gimple_stmt_iterator gsi;
>>>> + unsigned i;
>>>> +
>>>> + for (i = 0; i < num_nodes; i++)
>>>
>>> for (unsigned i = 0; i < num_nodes; i++)
>>>
>>> (and maybe you can just say loop->num_nodes here; I don't know if that
>>> generates worse code, or if that even matters).
>>
>> Good idea, will fix it.
>>
>>>
>>>> + if (dump_file && (dump_flags & TDF_DETAILS))
>>>> + fprintf (
>>>> + dump_file,
>>>> + "predict doloop failure due to finding computed jump.\n");
>>>
>>> We don't normally end lines in (. There are other solutions to why you
>>> did that here; you can use string pasting, to break the string into two,
>>> or factor out (some part of) the loop body to reduce indentation.
>>>
>>
>> Will adjust it.
>>
>>>
>>> Segher
>>>
>>
>
next prev parent reply other threads:[~2019-05-05 5:26 UTC|newest]
Thread overview: 22+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-04-24 8:49 Kewen.Lin
2019-04-24 9:08 ` Jakub Jelinek
2019-04-24 9:25 ` Kewen.Lin
2019-04-25 12:45 ` Segher Boessenkool
2019-04-26 6:58 ` Kewen.Lin
2019-04-26 7:59 ` Richard Biener
2019-04-26 14:32 ` Kewen.Lin
2019-04-26 17:06 ` Segher Boessenkool
2019-04-26 18:23 ` Richard Biener
2019-04-26 18:58 ` Segher Boessenkool
2019-05-05 3:42 ` Kewen.Lin
2019-04-26 16:44 ` Segher Boessenkool
2019-04-27 4:13 ` Bin.Cheng
2019-05-05 5:26 ` Kewen.Lin [this message]
2019-05-06 10:21 ` Richard Biener
2019-04-27 3:45 ` Bin.Cheng
2019-05-05 3:23 ` Kewen.Lin
2019-05-05 4:04 ` Bin.Cheng
2019-05-05 6:02 ` Kewen.Lin
2019-05-06 1:50 ` Bin.Cheng
2019-05-06 7:31 ` Kewen.Lin
2019-05-05 16:33 ` Segher Boessenkool
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=43ad1524-3592-c2f4-57c4-a44795330658@linux.ibm.com \
--to=linkw@linux.ibm.com \
--cc=amker.cheng@gmail.com \
--cc=bin.cheng@linux.alibaba.com \
--cc=gcc-patches@gcc.gnu.org \
--cc=jakub@redhat.com \
--cc=rguenther@suse.de \
--cc=segher@kernel.crashing.org \
--cc=wschmidt@linux.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).