public inbox for gcc-bugs@sourceware.org help / color / mirror / Atom feed
From: "rguenth at gcc dot gnu.org" <gcc-bugzilla@gcc.gnu.org> To: gcc-bugs@gcc.gnu.org Subject: [Bug tree-optimization/108352] [13 Regression] Dead Code Elimination Regression at -O2 since r13-1960-gd86d81a449c036 Date: Wed, 11 Jan 2023 11:14:37 +0000 [thread overview] Message-ID: <bug-108352-4-n2gIHmWEp8@http.gcc.gnu.org/bugzilla/> (raw) In-Reply-To: <bug-108352-4@http.gcc.gnu.org/bugzilla/> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108352 Richard Biener <rguenth at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |law at gcc dot gnu.org --- Comment #2 from Richard Biener <rguenth at gcc dot gnu.org> --- Checking profitability of path (backwards): bb:3 (6 insns) bb:9 (0 insns) (latch) bb:5 Control statement insns: 2 Overall: 4 insns [4] Registering jump thread: (5, 9) incoming edge; (9, 3) normal (back) (3, 4) nocopy; path: 5->9->3->4 SUCCESS but Checking profitability of path (backwards): bb:3 (6 insns) bb:9 (latch) Control statement insns: 2 Overall: 4 insns FAIL: Would create irreducible loop without threading multiway branch. path: 9->3->xx REJECTED we are no longer considering the first which just adds an unrelated jump to the path after the patch. That's the /* We avoid creating irreducible inner loops unless we thread through a multiway branch, in which case we have deemed it worth losing other loop optimizations later. We also consider it worth creating an irreducible inner loop if the number of copied statement is low relative to the length of the path -- in that case there's little the traditional loop optimizer would have done anyway, so an irreducible loop is not so bad. */ if (!threaded_multiway_branch && creates_irreducible_loop && *creates_irreducible_loop && (n_insns * (unsigned) param_fsm_scale_path_stmts > (m_path.length () * (unsigned) param_fsm_scale_path_blocks))) { if (dump_file && (dump_flags & TDF_DETAILS)) fprintf (dump_file, " FAIL: Would create irreducible loop without threading " "multiway branch.\n"); return false; heuristic which with 9 -> 3 is 4 * 2 > 2 * 3 but with 5 -> 9 -> 3 we get 4 * 2 > 3 * 3. It's also worth noting that neither of the two threads create an irreducible loop in the end for this particular case since e is also constant on entry and thus the jump is resolved and the extra loop entry is removed (but that's out of scope of the threaders analysis here). It IMHO still makes no sense to reject the shorter path over the longer one so the above "heuristic" makes absolutely no sense to me. Raising --param fsm-scale-path-blocks to 4 "fixes" the testcase on trunk. The heuristic was added in r6-6600-g2b572b3c213b51 by Jeff in the attempt to address a coremark regression (PR68398). I guess Jeff remembers nothing about this. Note this is not about adding inner irreducible loops but making loop itself irreducible. The length of the path itself also says nothing about the length of a path through the irreducible loop ... Reverting the heuristic will reject all non-multi-way branch irreducible loop creation. We have another heuristic that rejects threading through the latch early: /* Threading through an empty latch would cause code to be added to the latch. This could alter the loop form sufficiently to cause loop optimizations to fail. Disable these threads until after loop optimizations have run. */ if ((threaded_through_latch || (taken_edge && taken_edge->dest == loop->latch)) && !(cfun->curr_properties & PROP_loop_opts_done) && empty_block_p (loop->latch)) so we could reject irreducible loops before loop opts (w/o just covering the empty latch case) and otherwise generally allow it even for non-multi-way branches. That said, I fear I'm going to replace one bogus heuristic with another ;) I'm still going to test replacing the heuristic with the following (which allows to remove the fsm-scale-path-blocks param). /* We avoid creating irreducible inner loops unless we thread through a multiway branch, in which case we have deemed it worth losing other loop optimizations later. We also consider it worth creating an irreducible inner loop after loop optimizations if the number of copied statement is low. */ if (!m_threaded_multiway_branch && *creates_irreducible_loop && (!(cfun->curr_properties & PROP_loop_opts_done) || (m_n_insns * param_fsm_scale_path_stmts >= param_max_jump_thread_duplication_stmts))) { if (dump_file && (dump_flags & TDF_DETAILS)) fprintf (dump_file, " FAIL: Would create irreducible loop early without " "threading multiway branch.\n"); /* We compute creates_irreducible_loop only late. */ return false; }
next prev parent reply other threads:[~2023-01-11 11:14 UTC|newest] Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top 2023-01-10 12:21 [Bug tree-optimization/108352] New: Dead Code Elimination Regression at -O2 (trunk vs. 12.2.0) yann at ywg dot ch 2023-01-10 14:42 ` [Bug tree-optimization/108352] Dead Code Elimination Regression at -O2 since r13-1960-gd86d81a449c036 marxin at gcc dot gnu.org 2023-01-10 16:26 ` [Bug tree-optimization/108352] [13 Regression] " rguenth at gcc dot gnu.org 2023-01-11 11:14 ` rguenth at gcc dot gnu.org [this message] 2023-01-11 11:59 ` cvs-commit at gcc dot gnu.org 2023-01-11 12:07 ` rguenth at gcc dot gnu.org
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=bug-108352-4-n2gIHmWEp8@http.gcc.gnu.org/bugzilla/ \ --to=gcc-bugzilla@gcc.gnu.org \ --cc=gcc-bugs@gcc.gnu.org \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: linkBe sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).