public inbox for gcc-bugs@sourceware.org help / color / mirror / Atom feed
From: "rguenther at suse dot de" <gcc-bugzilla@gcc.gnu.org> To: gcc-bugs@gcc.gnu.org Subject: [Bug tree-optimization/113787] [12/13/14 Regression] Wrong code at -O with ipa-modref on aarch64 Date: Wed, 14 Feb 2024 08:19:31 +0000 [thread overview] Message-ID: <bug-113787-4-ytpMsPnoF4@http.gcc.gnu.org/bugzilla/> (raw) In-Reply-To: <bug-113787-4@http.gcc.gnu.org/bugzilla/> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113787 --- Comment #16 from rguenther at suse dot de <rguenther at suse dot de> --- On Tue, 13 Feb 2024, hubicka at ucw dot cz wrote: > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113787 > > --- Comment #15 from Jan Hubicka <hubicka at ucw dot cz> --- > > > > IVOPTs does the above but it does it (or should) as > > > > offset = (uintptr)&base2 - (uintptr)&base1; > > val = *((T *)((uintptr)base1 + i + offset)) > > > > which is OK for points-to as no POINTER_PLUS_EXPR is involved so the > > resulting pointer points to both base1 and base2 (which isn't optimal > > but correct). > > > > If we somehow get back a POINTER_PLUS that's where things go wrong. > > > > Doing the above in C code would be valid input so we have to treat > > it correctly (OK, the standard only allows back-and-forth > > pointer-to-integer casts w/o any adjustment, but of course we relax > > this). > > OK. Modrefs tracks base pointer for accesses and tries to prove that > they are function parameters. This should immitate ivopts: > void > __attribute__ ((noinline)) > set(int *a, unsigned long off) > { > *(int *)((unsigned long)a + off) = 1; > } > int > test () > { > int a; > int b = 0; > set (&a, (unsigned long)&b - (unsigned long)&a); > return b; > } > > Here set gets following gimple at modref2 time: > __attribute__((noinline)) > void set (int * a, long unsigned int off) > { > long unsigned int a.0_1; > long unsigned int _2; > int * _3; > > <bb 2> [local count: 1073741824]: > a.0_1 = (long unsigned int) a_4(D); > _2 = a.0_1 + off_5(D); > _3 = (int *) _2; > *_3 = 1; > return; > > } > > This is not pattern matched so modref does not think the access has a as > a base: > > stores: > Base 0: alias set 1 > Ref 0: alias set 1 > Every access > > While for: > > void > __attribute__ ((noinline)) > set(int *a, unsigned long off) > { > *(a+off/sizeof(int))=1; > } > > we produce: > > __attribute__((noinline)) > void set (int * a, long unsigned int off) > { > sizetype _1; > int * _2; > > <bb 2> [local count: 1073741824]: > _1 = off_3(D) & 18446744073709551612; > _2 = a_4(D) + _1; > *_2 = 1; > return; > } > > And this is understood: > > stores: > Base 0: alias set 1 > Ref 0: alias set 1 > access: Parm 0 > > If we consider it correct to optimize out the conversion from and to > pointer type, then I suppose any addition of pointer and integer which > we do not see means that we need to give up on tracking base completely. > > I guess PTA gets around by tracking points-to set also for non-pointer > types and consequently it also gives up on any such addition. > > But what we really get from relaxing this? > > > > IVOPTs then in putting all of the stuff into 'offset' gets at > > trying a TARGET_MEM_REF based on a NULL base but that's invalid. > > We then resort to a LEA (ADDR_EXPR of TARGET_MEM_REF) to compute > > the address which gets us into some phishy argument that it's > > not valid to decompose ADDR_EXPR of TARGET_MEM_REF to > > POINTER_PLUS of the TARGET_MEM_REF base and the offset. But > > that's how it is (points-to treats (address of) TARGET_MEM_REF > > as pointing to anything ...). > > > > > A quick fix would be to run IPA modref before ivopts, but I do not see how such > > > transformation can work with rest of alias analysis (PTA etc) > > > > It does. Somewhere IPA modref interprets things wrongly, I didn't figure > > out here though. > > > I guess PTA gets around by tracking points-to set also for non-pointer > types and consequently it also gives up on any such addition. It does. But note it does _not_ for POINTER_PLUS where it treats the offset operand as non-pointer. > I think it is ipa-prop.c::unadjusted_ptr_and_unit_offset. It accepts > pointer_plus expression, but does not look through POINTER_PLUS. > We can restrict it further, but tracking base pointer is quite useful, > so it would be nice to not give up completely. It looks like that function might treat that ADDR_EXPR <TARGET_MEM_REF <0B, ...>> as integer_zerop base. It does if (TREE_CODE (op) == ADDR_EXPR) { poly_int64 extra_offset = 0; tree base = get_addr_base_and_unit_offset (TREE_OPERAND (op, 0), &offset); if (!base) { base = get_base_address (TREE_OPERAND (op, 0)); if (TREE_CODE (base) != MEM_REF) break; offset_known = false; } else { if (TREE_CODE (base) != MEM_REF) break; with a variable offset we fall to the TREE_CODE (base) != MEM_REF and will have offset_known == true. Not sure what it does with the result though (it's not the address of a decl). This function seems to oddly special-case != MEM_REF ... (maybe it wants to hande DECL_P () as finishing? Note get_addr_base_and_unit_offset will return NULL for a TARGET_MEM_REF <&decl, ..., offset> but TARGET_MEM_REF itself if the base isn't an ADDR_EXPR, irrespective of whether the offset within it is constant or not. Not sure if the above is a problem, but it seems the only caller will just call points_to_local_or_readonly_memory_p on the ADDR_EXPR where refs_local_or_readonly_memory_p via points_to_local_or_readonly_memory_p will eventually do /* See if memory location is clearly invalid. */ if (integer_zerop (t)) return flag_delete_null_pointer_checks; and that might be a problem. As said, we rely on ADDR_EXPR <TARGET_MEM_REF <...> > to be an address computation that's not subject to strict interpretation to allow IVOPTs doing this kind of optimization w/o introducing some kind of INTEGER_LEA <...>. I know that's a bit awkward but we should make sure this is honored by IPA as well. I'd say diff --git a/gcc/ipa-fnsummary.cc b/gcc/ipa-fnsummary.cc index 74c9b4e1d1e..45a770cf940 100644 --- a/gcc/ipa-fnsummary.cc +++ b/gcc/ipa-fnsummary.cc @@ -2642,7 +2642,8 @@ points_to_local_or_readonly_memory_p (tree t) return true; return !ptr_deref_may_alias_global_p (t, false); } - if (TREE_CODE (t) == ADDR_EXPR) + if (TREE_CODE (t) == ADDR_EXPR + && TREE_CODE (TREE_OPERAND (t, 0)) != TARGET_MEM_REF) return refs_local_or_readonly_memory_p (TREE_OPERAND (t, 0)); return false; } might eventually work? Alternatively a bit less aggressive like the following. diff --git a/gcc/ipa-fnsummary.cc b/gcc/ipa-fnsummary.cc index 74c9b4e1d1e..7c79adf6440 100644 --- a/gcc/ipa-fnsummary.cc +++ b/gcc/ipa-fnsummary.cc @@ -2642,7 +2642,9 @@ points_to_local_or_readonly_memory_p (tree t) return true; return !ptr_deref_may_alias_global_p (t, false); } - if (TREE_CODE (t) == ADDR_EXPR) + if (TREE_CODE (t) == ADDR_EXPR + && (TREE_CODE (TREE_OPERAND (t, 0)) != TARGET_MEM_REF + || TREE_CODE (TREE_OPERAND (TREE_OPERAND (t, 0), 0)) != INTEGER_CST)) return refs_local_or_readonly_memory_p (TREE_OPERAND (t, 0)); return false; } A "nicer" solution might be to add a informational operand to TARGET_MEM_REF, representing the base pointer to be used for alias/points-to purposes. But if that's not invariant it might keep some otherwise unnecessary definition stmts live.
next prev parent reply other threads:[~2024-02-14 8:19 UTC|newest] Thread overview: 24+ messages / expand[flat|nested] mbox.gz Atom feed top 2024-02-06 13:40 [Bug tree-optimization/113787] New: [14 " acoplan at gcc dot gnu.org 2024-02-06 13:49 ` [Bug tree-optimization/113787] " jakub at gcc dot gnu.org 2024-02-06 13:56 ` pinskia at gcc dot gnu.org 2024-02-06 13:57 ` acoplan at gcc dot gnu.org 2024-02-06 14:07 ` acoplan at gcc dot gnu.org 2024-02-06 14:13 ` [Bug tree-optimization/113787] [12/13/14 " jakub at gcc dot gnu.org 2024-02-06 14:19 ` pinskia at gcc dot gnu.org 2024-02-06 14:23 ` acoplan at gcc dot gnu.org 2024-02-06 15:41 ` hubicka at gcc dot gnu.org 2024-02-06 16:18 ` pinskia at gcc dot gnu.org 2024-02-07 8:48 ` rguenth at gcc dot gnu.org 2024-02-07 8:49 ` rguenth at gcc dot gnu.org 2024-02-08 14:40 ` acoplan at gcc dot gnu.org 2024-02-13 9:03 ` hubicka at gcc dot gnu.org 2024-02-13 9:21 ` rguenther at suse dot de 2024-02-13 18:21 ` hubicka at ucw dot cz 2024-02-14 8:19 ` rguenther at suse dot de [this message] 2024-02-14 15:07 ` Jan Hubicka 2024-02-14 15:07 ` hubicka at ucw dot cz 2024-02-14 15:09 ` rguenther at suse dot de 2024-02-14 15:18 ` hubicka at ucw dot cz 2024-05-16 9:07 ` [Bug tree-optimization/113787] [12/13/14/15 " acoplan at gcc dot gnu.org 2024-05-16 13:34 ` cvs-commit at gcc dot gnu.org 2024-05-16 13:39 ` [Bug tree-optimization/113787] [12/13/14 " hubicka at gcc dot gnu.org
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=bug-113787-4-ytpMsPnoF4@http.gcc.gnu.org/bugzilla/ \ --to=gcc-bugzilla@gcc.gnu.org \ --cc=gcc-bugs@gcc.gnu.org \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: linkBe sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).