public inbox for gcc-bugs@sourceware.org help / color / mirror / Atom feed
From: "rguenth at gcc dot gnu.org" <gcc-bugzilla@gcc.gnu.org> To: gcc-bugs@gcc.gnu.org Subject: [Bug tree-optimization/44563] GCC uses a lot of RAM when compiling a large numbers of functions Date: Mon, 09 Mar 2015 15:36:00 -0000 [thread overview] Message-ID: <bug-44563-4-tkz3CnQoqp@http.gcc.gnu.org/bugzilla/> (raw) In-Reply-To: <bug-44563-4@http.gcc.gnu.org/bugzilla/> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=44563 --- Comment #18 from Richard Biener <rguenth at gcc dot gnu.org> --- (In reply to Richard Biener from comment #17) > (In reply to Richard Biener from comment #16) > > callgrind shows the cgraph_edge_hasher quite high in the profile (via > > redirect_all_calls). I suppose as the large main is a single BB walking > > all stmts over-and-over is quite bad. Also hash_pointer isn't inlined!? > > Ah - it's external in libiberty hashtab.c ... - it should transition to > > using/inheriting from pointer_hash. > > > > cgraph_edge * > > cgraph_node::get_edge (gimple call_stmt) > > { > > cgraph_edge *e, *e2; > > int n = 0; > > > > if (call_site_hash) > > return call_site_hash->find_with_hash (call_stmt, > > htab_hash_pointer (call_stmt)); > > > > Btw, for 10000 calls (smaller testcase) we get 100 000 000 calls to > cgraph_edge::redirect_call_stmt_to_callee () (that's from 40000 > redirect_all_calls calls which is from 10000 optimize_inline_calls calls). > > Ah - we do this also for the ENTRY/EXIT block! > > Index: gcc/tree-inline.c > =================================================================== > --- gcc/tree-inline.c (revision 221278) > +++ gcc/tree-inline.c (working copy) > @@ -2802,11 +2802,13 @@ copy_cfg_body (copy_body_data * id, gcov > if (need_debug_cleanup > && bb->index != ENTRY_BLOCK > && bb->index != EXIT_BLOCK) > - maybe_move_debug_stmts_to_successors (id, (basic_block) bb->aux); > - /* Update call edge destinations. This can not be done before loop > - info is updated, because we may split basic blocks. */ > - if (id->transform_call_graph_edges == CB_CGE_DUPLICATE) > - redirect_all_calls (id, (basic_block)bb->aux); > + { > + maybe_move_debug_stmts_to_successors (id, (basic_block) bb->aux); > + /* Update call edge destinations. This can not be done before > loop > + info is updated, because we may split basic blocks. */ > + if (id->transform_call_graph_edges == CB_CGE_DUPLICATE) > + redirect_all_calls (id, (basic_block)bb->aux); > + } > ((basic_block)bb->aux)->aux = NULL; > bb->aux = NULL; > } > > makes sense? Fails to bootstrap :/ But would improve the testcase to only have the inline heuristic issue. /space/rguenther/src/svn/trunk/libstdc++-v3/libsupc++/pbase_type_info.cc: In member function ‘virtual bool __cxxabiv1::__pbase_type_info::__do_catch(const std::type_info*, void**, unsigned int) const’: /space/rguenther/src/svn/trunk/libstdc++-v3/libsupc++/pbase_type_info.cc:32:6: error: reference to dead statement bool __pbase_type_info:: ^ # .MEM = VDEF <.MEM> _30 = OBJ_TYPE_REF(_28;(const struct __pbase_type_info)this_3(D)->6) (this_3(D), thr_type_5(D), thr_obj_9(D), outer_29); _ZNK10__cxxabiv117__pbase_type_info10__do_catchEPKSt9type_infoPPvj/74 (virtual bool __cxxabiv1::__pbase_type_info::__do_catch(const std::type_info*, void**, unsigned int) const) @0x2aaaac8d3ab8 Type: function definition analyzed Visibility: externally_visible public visibility_specified virtual Address is taken. References: _ZNK10__cxxabiv117__pbase_type_info15__pointer_catchEPKS0_PPvj/34 (addr) (speculative) Referring: _ZTVN10__cxxabiv117__pbase_type_infoE/77 (addr) Availability: overwritable First run: 0 Function flags: body Called by: Calls: strcmp/85 (0.39 per call) __cxa_bad_typeid/83 (can throw external) strcmp/85 (0.61 per call) Indirect call(0.11 per call) (can throw external) Polymorphic indirect call of type const struct __pbase_type_info token:6(speculative) (0.03 per call) (can throw external) of param:0 Outer type (dynamic):struct __pbase_type_info (or a derived type) offset 0 /space/rguenther/src/svn/trunk/libstdc++-v3/libsupc++/pbase_type_info.cc:32:6: internal compiler error: verify_cgraph_node failed 0xa8eebc cgraph_node::verify_node() /space/rguenther/src/svn/trunk/gcc/cgraph.c:3115 0xa8473f symtab_node::verify() /space/rguenther/src/svn/trunk/gcc/symtab.c:1103 0x1025861 optimize_inline_calls(tree_node*) /space/rguenther/src/svn/trunk/gcc/tree-inline.c:4938 > > The estimate_calls_size_and_time portion is quite smaller. > > > > cleanup-cfgs main portion is split_bb_on_noreturn_calls. >From gcc-bugs-return-479823-listarch-gcc-bugs=gcc.gnu.org@gcc.gnu.org Mon Mar 09 15:55:22 2015 Return-Path: <gcc-bugs-return-479823-listarch-gcc-bugs=gcc.gnu.org@gcc.gnu.org> Delivered-To: listarch-gcc-bugs@gcc.gnu.org Received: (qmail 130953 invoked by alias); 9 Mar 2015 15:55:22 -0000 Mailing-List: contact gcc-bugs-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: <gcc-bugs.gcc.gnu.org> List-Archive: <http://gcc.gnu.org/ml/gcc-bugs/> List-Post: <mailto:gcc-bugs@gcc.gnu.org> List-Help: <mailto:gcc-bugs-help@gcc.gnu.org> Sender: gcc-bugs-owner@gcc.gnu.org Delivered-To: mailing list gcc-bugs@gcc.gnu.org Received: (qmail 130855 invoked by uid 48); 9 Mar 2015 15:55:16 -0000 From: "derodat at adacore dot com" <gcc-bugzilla@gcc.gnu.org> To: gcc-bugs@gcc.gnu.org Subject: [Bug debug/53927] wrong value for DW_AT_static_link Date: Mon, 09 Mar 2015 15:55:00 -0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: gcc X-Bugzilla-Component: debug X-Bugzilla-Version: 4.6.3 X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: derodat at adacore dot com X-Bugzilla-Status: NEW X-Bugzilla-Priority: P3 X-Bugzilla-Assigned-To: unassigned at gcc dot gnu.org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: Message-ID: <bug-53927-4-sGp0FNDm4B@http.gcc.gnu.org/bugzilla/> In-Reply-To: <bug-53927-4@http.gcc.gnu.org/bugzilla/> References: <bug-53927-4@http.gcc.gnu.org/bugzilla/> Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 7bit X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-SW-Source: 2015-03/txt/msg00967.txt.bz2 Content-length: 3119 https://gcc.gnu.org/bugzilla/show_bug.cgi?idS927 --- Comment #21 from Pierre-Marie de Rodat <derodat at adacore dot com> --- (In reply to Eric Botcazou from comment #18) > I think this is worth investigating though because it's conceptually > much simpler than adding yet another indirection. And we should > concentrate on -O0 (and -Og), we don't really care about what happens > with aggressive optimization. Understood and agreed. Nevertheless... > I guess the question is: can we arrange to have a constant offset > between the frame base and the FRAME object, "constant" meaning valid > for every function but possibly target-dependent? I started to hack into cfgexpand.c and dwarf2out.c, but I realized this is not possible in the general case. Consider the following example: #include <stdlib.h> int nestee (void) { int a __attribute__((aligned(64))) = 1234; void nested (int b) { a = b; } nested (2345); return a; } int call_nestee (int n) { int *v = alloca (sizeof (int) * n); v[0] = nestee (); return v[0]; } int main (void) { call_nestee (1); call_nestee (8); return 0; } With a GCC 5.0 built from fairly recent sources, I get the following CFA information: 00000090 000000000000002c 00000064 FDE cie\0000030 pc\0000000004004ac..00000000004004eb DW_CFA_advance_loc: 5 to 00000000004004b1 DW_CFA_def_cfa: r10 (r10) ofs 0 DW_CFA_advance_loc: 9 to 00000000004004ba DW_CFA_expression: r6 (rbp) (DW_OP_breg6 (rbp): 0) DW_CFA_advance_loc: 5 to 00000000004004bf DW_CFA_def_cfa_expression (DW_OP_breg6 (rbp): -8; DW_OP_deref) DW_CFA_advance_loc: 38 to 00000000004004e5 And now here is what I get under GDB: $ gdb -n -q -ex 'b nestee' ./dyn_frame Reading symbols from ./dyn_frame...done. Breakpoint 1 at 0x4004c3: file dyn_frame.c, line 6. (gdb) r [...] Breakpoint 1, nestee () at dyn_frame.c:6 6 int a __attribute__((aligned(64))) = 1234; (gdb) p $pc $1 = (void (*)()) 0x4004c3 <nestee+23> (gdb) x/1xg $rbp - 8 0x7fffffffdf28: 0x00007fffffffdf60 (gdb) p/x (char *) 0x00007fffffffdf60 - (char *) &a $2 = 0xa0 ... so for this frame, the CFA and the FRAME object are 0xa0 bytes from each other. Now let's resume to see the next "nestee" frame: (gdb) c Continuing. Breakpoint 1, nestee () at dyn_frame.c:6 6 int a __attribute__((aligned(64))) = 1234; (gdb) p $pc $3 = (void (*)()) 0x4004c3 <nestee+23> (gdb) x/1xg $rbp - 8 0x7fffffffdf28: 0x00007fffffffdf50 (gdb) p/x (char *) 0x00007fffffffdf50 - (char *) &a $4 = 0x90 The offset between the CFA and e FRAME object is now 0x90 bytes. So because of alignment constraints, I think we cannot assume we can have a constant offset (even function-dependent). This offset is dynamic and the only way to compute it is to use the frame's context: here, nestee's saved registers, which we don't have access to in DWARF when computing the static link attribute.
next prev parent reply other threads:[~2015-03-09 15:36 UTC|newest] Thread overview: 33+ messages / expand[flat|nested] mbox.gz Atom feed top [not found] <bug-44563-4@http.gcc.gnu.org/bugzilla/> 2010-12-12 23:55 ` hubicka at gcc dot gnu.org 2010-12-13 0:58 ` hubicka at gcc dot gnu.org 2010-12-13 1:00 ` hubicka at gcc dot gnu.org 2010-12-13 1:07 ` hubicka at gcc dot gnu.org 2010-12-13 1:17 ` hubicka at gcc dot gnu.org 2010-12-13 1:47 ` hubicka at gcc dot gnu.org 2010-12-13 13:22 ` hubicka at gcc dot gnu.org 2010-12-17 0:08 ` hubicka at gcc dot gnu.org 2015-03-07 0:22 ` hubicka at gcc dot gnu.org 2015-03-09 9:36 ` rguenth at gcc dot gnu.org 2015-03-09 11:33 ` rguenth at gcc dot gnu.org 2015-03-09 15:26 ` rguenth at gcc dot gnu.org 2015-03-09 15:36 ` rguenth at gcc dot gnu.org [this message] 2015-03-10 4:55 ` hubicka at ucw dot cz 2015-03-10 8:26 ` rguenth at gcc dot gnu.org 2015-03-10 8:35 ` rguenther at suse dot de 2015-03-10 11:54 ` rguenth at gcc dot gnu.org 2015-03-10 12:03 ` rguenth at gcc dot gnu.org 2015-03-10 12:31 ` jakub at gcc dot gnu.org 2015-03-10 12:44 ` rguenth at gcc dot gnu.org 2015-03-10 12:51 ` rguenther at suse dot de 2015-03-10 13:19 ` rguenth at gcc dot gnu.org 2015-03-12 15:09 ` rguenth at gcc dot gnu.org 2015-03-13 8:43 ` [Bug ipa/44563] " rguenth at gcc dot gnu.org 2015-03-13 8:47 ` rguenth at gcc dot gnu.org 2015-03-13 8:53 ` rguenth at gcc dot gnu.org 2015-03-13 8:55 ` rguenth at gcc dot gnu.org 2015-03-16 0:07 ` hubicka at ucw dot cz 2024-02-16 13:52 ` rguenth at gcc dot gnu.org 2010-06-17 4:10 [Bug c/44563] New: " jvoss at altsci dot com 2010-06-17 4:14 ` [Bug tree-optimization/44563] " jvoss at altsci dot com 2010-06-17 10:37 ` rguenth at gcc dot gnu dot org 2010-06-17 10:45 ` rguenth at gcc dot gnu dot org 2010-06-17 12:28 ` jakub at gcc dot gnu dot org
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=bug-44563-4-tkz3CnQoqp@http.gcc.gnu.org/bugzilla/ \ --to=gcc-bugzilla@gcc.gnu.org \ --cc=gcc-bugs@gcc.gnu.org \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: linkBe sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).