public inbox for gcc-bugs@sourceware.org help / color / mirror / Atom feed
From: "rguenth at gcc dot gnu.org" <gcc-bugzilla@gcc.gnu.org> To: gcc-bugs@gcc.gnu.org Subject: [Bug tree-optimization/108500] [11/12 Regression] -O -finline-small-functions results in "internal compiler error: Segmentation fault" on a very large program (700k function calls) Date: Thu, 02 Feb 2023 10:22:55 +0000 [thread overview] Message-ID: <bug-108500-4-GM1K1pTq0l@http.gcc.gnu.org/bugzilla/> (raw) In-Reply-To: <bug-108500-4@http.gcc.gnu.org/bugzilla/> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108500 Richard Biener <rguenth at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |jamborm at gcc dot gnu.org, | |vmakarov at gcc dot gnu.org --- Comment #14 from Richard Biener <rguenth at gcc dot gnu.org> --- Thanks for the new testcase. With -O0 (and a --enable-checking=release built compiler) this builds in ~11 minutes (on a Ryzen 9 7900X) with integrated RA : 38.96 ( 6%) 1.94 ( 20%) 42.00 ( 6%) 3392M ( 23%) LRA non-specific : 18.93 ( 3%) 1.24 ( 13%) 23.78 ( 4%) 450M ( 3%) LRA virtuals elimination : 5.67 ( 1%) 0.05 ( 1%) 5.75 ( 1%) 457M ( 3%) LRA reload inheritance : 318.25 ( 49%) 0.24 ( 2%) 318.51 ( 48%) 0 ( 0%) LRA create live ranges : 199.24 ( 31%) 0.12 ( 1%) 199.38 ( 30%) 228M ( 2%) 645.67user 10.29system 11:04.42elapsed 98%CPU (0avgtext+0avgdata 30577844maxresident)k 3936200inputs+1091808outputs (122053major+10664929minor)pagefaults 0swaps so register allocation taking all of the time. There's maybe the possibility to gate some of its features on the # of BBs or insns (or whatever the actual "bad" thing is - I didn't look closer yet). It also seems to use 30GB of peak memory at -O0 ... For -O the situation is "better": tree PTA : 987.21 ( 99%) 0.41 ( 12%) 987.70 ( 99%) 128 ( 0%) 992.56user 3.53system 16:36.20elapsed 99%CPU (0avgtext+0avgdata 2968740maxresident)k 42576inputs+8outputs (28major+717414minor)pagefaults 0swaps which suggests a clear workaround, -fno-tree-pta, which makes it compile in 5s for me. Doing -O -finline-small-functions -fno-tree-pta we get a very high compile-time in SRAs propagate_all_subaccesses which probably sees a very large struct copy chain tem1 = s2; s2 = tem1; tem2 = s2; s2 = tem2; ... and somehow ends up quadratic (possibly switching the candidate_bitmap to tree form at the start of propagate_all_subaccesses will help a bit). tree form bitmap doesn't help, I guess we end up queueing all elements in the copy chain to the worklist and via the chains end up with a O(n^2) working set. The testcase can probably be shortened to get at this problem. SRA is actually quite important here, so disabling SRA as a workaround doesn't look to improve the situation a lot. Still with -fno-tree-sra added we get good compile time and DCE/DSE remove all code plus -fno-tree-pta isn't required. Martin, can you look at the SRA issue? Do you want me to create a separate bugreport for this? The IL into SRA looks like <bb 2> : s2D.2755 = {}; s1D.2756 = {}; _unusedD.2002766 = s1D.2756; sD.2002767 = s2D.2755; s2D.2755 = sD.2002767; _unusedD.2002766 ={v} {CLOBBER(eol)}; sD.2002767 ={v} {CLOBBER(eol)}; _unusedD.2002764 = s1D.2756; sD.2002765 = s2D.2755; s2D.2755 = sD.2002765; _unusedD.2002764 ={v} {CLOBBER(eol)}; sD.2002765 ={v} {CLOBBER(eol)}; _unusedD.2002762 = s1D.2756; sD.2002763 = s2D.2755; s2D.2755 = sD.2002763; _unusedD.2002762 ={v} {CLOBBER(eol)}; sD.2002763 ={v} {CLOBBER(eol)}; _unusedD.2002760 = s1D.2756; sD.2002761 = s2D.2755; s2D.2755 = sD.2002761; _unusedD.2002760 ={v} {CLOBBER(eol)}; sD.2002761 ={v} {CLOBBER(eol)}; ...
next prev parent reply other threads:[~2023-02-02 10:22 UTC|newest] Thread overview: 26+ messages / expand[flat|nested] mbox.gz Atom feed top 2023-01-23 17:30 [Bug c/108500] New: " dhekir at gmail dot com 2023-01-23 17:55 ` [Bug c/108500] " dhekir at gmail dot com 2023-01-24 2:57 ` [Bug ipa/108500] " pinskia at gcc dot gnu.org 2023-01-24 3:18 ` [Bug tree-optimization/108500] " pinskia at gcc dot gnu.org 2023-01-24 9:32 ` rguenth at gcc dot gnu.org 2023-01-24 9:43 ` rguenth at gcc dot gnu.org 2023-01-24 9:48 ` rguenth at gcc dot gnu.org 2023-01-24 12:36 ` [Bug tree-optimization/108500] [11/12/13 Regression] " rguenth at gcc dot gnu.org 2023-01-24 14:29 ` cvs-commit at gcc dot gnu.org 2023-01-24 14:30 ` [Bug tree-optimization/108500] [11/12 " rguenth at gcc dot gnu.org 2023-02-01 7:47 ` cvs-commit at gcc dot gnu.org 2023-02-01 8:42 ` rguenth at gcc dot gnu.org 2023-02-01 16:06 ` dhekir at gmail dot com 2023-02-01 16:07 ` dhekir at gmail dot com 2023-02-02 10:22 ` rguenth at gcc dot gnu.org [this message] 2023-02-02 10:34 ` rguenth at gcc dot gnu.org 2023-02-02 14:01 ` rguenth at gcc dot gnu.org 2023-02-02 17:12 ` dhekir at gmail dot com 2023-02-03 7:06 ` rguenth at gcc dot gnu.org 2023-02-03 8:20 ` rguenth at gcc dot gnu.org 2023-02-10 14:05 ` vmakarov at gcc dot gnu.org 2023-02-10 16:45 ` cvs-commit at gcc dot gnu.org 2023-02-13 7:42 ` rguenth at gcc dot gnu.org 2023-03-15 9:47 ` cvs-commit at gcc dot gnu.org 2023-03-15 10:03 ` [Bug tree-optimization/108500] [11 " rguenth at gcc dot gnu.org 2023-05-05 8:09 ` rguenth at gcc dot gnu.org
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=bug-108500-4-GM1K1pTq0l@http.gcc.gnu.org/bugzilla/ \ --to=gcc-bugzilla@gcc.gnu.org \ --cc=gcc-bugs@gcc.gnu.org \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: linkBe sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).