From: Richard Biener <rguenther@suse.de>
To: Jan Hubicka <hubicka@ucw.cz>
Cc: gcc-patches@gcc.gnu.org, d@dcepelik.cz
Subject: Re: Make nonoverlapping_component_refs work with duplicated main variants
Date: Tue, 09 Jul 2019 12:21:00 -0000 [thread overview]
Message-ID: <alpine.LSU.2.20.1907091402510.2976@zhemvz.fhfr.qr> (raw)
In-Reply-To: <20190709114917.qva4nb2h7j5vzdur@kam.mff.cuni.cz>
[-- Attachment #1: Type: text/plain, Size: 16532 bytes --]
On Tue, 9 Jul 2019, Jan Hubicka wrote:
> Hi,
> this is updated patch. I based the logic on gimple_compare_field_offset but
> dropped bits about PLACEHOLDER_EXPR since my understanding is that to get
> comparsion done I would then need to compare operand #2 of COMPONENT_REF which
> I don't.
>
> I also wrote the range checks using polyint since I believe that most code is
> supposed to be updated this way (shall we update gimple_compare_field_offset?
For consistency yes I guess but IIRC they cannot really appear in
FIELD_DECLs.
> It is used in canonical type merging and considering fields different may lead
> to wrong code I would say, but I do not know enough about SVE to construct
> testcase).
>
> I updated documentation of return values which I also find somewhat confusing
> since 1/0 meanings in nonoverlapping_* is reversed compared to
> aliasing_component_refs. Main entry is called refs_may_alias so I am
> considering rename all the functions into *_may_alias and make them return 0
> for disambiguation, 1 for non-disambiguation and use -1's to keep decidion
> whether to work harder.
>
> However for now I am sticking with the nonoverlapping reversed semantics.
>
>
> There are no changes in tramp3d stats (there are no unmerged types)
> Here are stats on cc1plus build:
>
> Alias oracle query stats:
> refs_may_alias_p: 43435216 disambiguations, 51989307 queries
> ref_maybe_used_by_call_p: 60843 disambiguations, 44040532 queries
> call_may_clobber_ref_p: 6051 disambiguations, 9115 queries
> nonoverlapping_component_refs_p: 0 disambiguations, 2535 queries
> nonoverlapping_component_refs_since_match_p: 12297 disambiguations, 40458 must overlaps, 53243 queries
> aliasing_component_refs_p: 70892 disambiguations, 374789 queries
> TBAA oracle: 12680127 disambiguations 32179405 queries
> 10383370 are in alias set 0
> 5747729 queries asked about the same object
> 148 queries asked about the same alias set
> 0 access volatile
> 2482808 are dependent in the DAG
> 885223 are aritificially in conflict with void *
>
> PTA query stats:
> pt_solution_includes: 562382 disambiguations, 7467931 queries
> pt_solutions_intersect: 412873 disambiguations, 7818955 queries
>
> It seems that nonoverlapping_component_refs_since_match_p is pretty good on
> making decision and there are only few cases where it return -1 which is good.
> So i guess teaching it about ARRAY_REFs is logical next step.
>
> If I would like to benchmark alias oracle compile time, what is
> reasonable way to do that?
>
> Bootstrapped/regtested x86_64-linux, OK?
>
> Honza
>
> * tree-ssa-alias.c (nonoverlapping_component_refs_p_1): Break out
> from ...; work also on duplicated types.
> (nonoverlapping_component_refs_since_match): ... here
> (ncr_type_uid): Break out from ...
> (ncr_compar): ... here; look for TYPE_UID of canonical type if
> available.
> (nonoverlapping_component_refs_p): Use same_type_for_tbaa to match
> the types and nonoverlapping_component_refs_p_1 to disambiguate.
> * g++.dg/lto/alias-3_0.C: New file.
> * g++.dg/lto/alias-3_1.c: New file.
>
> Index: tree-ssa-alias.c
> ===================================================================
> --- tree-ssa-alias.c (revision 273193)
> +++ tree-ssa-alias.c (working copy)
> @@ -1128,6 +1128,90 @@ aliasing_component_refs_p (tree ref1,
> return false;
> }
>
> +/* FIELD1 and FIELD2 are two component refs whose bases either do
> + not overlap at all or their addresses are the same.
> +
> + Return 1 if FIELD1 and FIELD2 are non-overlapping
> +
> + Return 0 if FIELD1 and FIELD2 are posisbly overlapping but in case of
> + overlap their addresses are the same.
> +
> + Return -1 otherwise.
> +
> + Main difference between 0 and -1 is to let
> + nonoverlapping_component_refs_since_match_p discover the semnatically
> + equivalent part of the access path. */
> +
> +static int
> +nonoverlapping_component_refs_p_1 (const_tree field1, const_tree field2)
> +{
> + /* If both fields are of the same type, we could save hard work of
> + comparing offsets.
> + ??? We cannot simply use the type of operand #0 of the refs here
> + as the Fortran compiler smuggles type punning into COMPONENT_REFs
> + for common blocks instead of using unions like everyone else. */
> + tree type1 = DECL_CONTEXT (field1);
> + tree type2 = DECL_CONTEXT (field2);
> +
drop the vertical space
> + if (type1 == type2 && TREE_CODE (type1) == RECORD_TYPE)
> + {
> + if (field1 == field2)
> + return 0;
> + /* A field and its representative need to be considered the
> + same. */
> + if (DECL_BIT_FIELD_REPRESENTATIVE (field1) == field2
> + || DECL_BIT_FIELD_REPRESENTATIVE (field2) == field1)
> + return 0;
> + /* Different fields of the same record type cannot overlap.
> + ??? Bitfields can overlap at RTL level so punt on them. */
> + if (DECL_BIT_FIELD (field1) && DECL_BIT_FIELD (field2))
> + return 0;
> + /* Assume that different FIELD_DECLs never overlap in a RECORD_TYPE. */
> + return 1;
> + }
> + else
> + {
drop the else {
> + /* Different fields of the same record type cannot overlap.
> + ??? Bitfields can overlap at RTL level so punt on them. */
> + if (DECL_BIT_FIELD (field1) && DECL_BIT_FIELD (field2))
> + return 0;
> +
don't you need the DECL_BIT_FIELD_REPRESENTATIVE check here as well?
I'd do
if (DECL_BIT_FIELD_REPRESENTATIVE (field1))
field1 = DECL_BIT_FIELD_REPRESENTATIVE (field1);
if (DECL_BIT_FIELD_REPRESENTATIVE (field2))
field2 = DECL_BIT_FIELD_REPRESENTATIVE (field2);
thus use the representative for the overlap check. It might
be the case that we can improve here and if we do this
can do the DECL_BIT_FIELD check after this (hoping the
representative doesn't have it set).
> + if (tree_int_cst_equal (DECL_FIELD_OFFSET (field1),
> + DECL_FIELD_OFFSET (field2))
> + && tree_int_cst_equal (DECL_FIELD_BIT_OFFSET (field1),
> + DECL_FIELD_BIT_OFFSET (field2)))
> + return 0;
In gimple_compare_field_offset this was fast-pathed for
DECL_OFFSET_ALIGN (f1) == DECL_OFFSET_ALIGN (f2) so I suggest to
do that here as well. Note that DECL_FIELD_OFFSET can be
a non-constant which means you cannot use tree_int_cst_equal
unconditionally here but you have to use operand_equal_p.
> + /* Note that it may be possible to use component_ref_field_offset
> + which would provide offsets as trees. However constructing and folding
> + trees is expensive and does not seem to be worth the compile time
> + cost. */
> +
> + poly_uint64 offset1, offset2;
> + poly_uint64 bit_offset1, bit_offset2;
> + poly_uint64 size1, size2;
I think you need poly_offset_int here since you convert to bits below.
The gimple_compare_field_offset checking way looks cheaper btw, so
I wonder why you don't simply call it but replicate things here?
When do we expect to have partially overlapping field decls? Even
when considering canonical type merging?
> + if (poly_int_tree_p (DECL_FIELD_OFFSET (field1), &offset1)
> + && poly_int_tree_p (DECL_FIELD_OFFSET (field2), &offset2)
> + && poly_int_tree_p (DECL_FIELD_BIT_OFFSET (field1), &bit_offset1)
> + && poly_int_tree_p (DECL_FIELD_BIT_OFFSET (field2), &bit_offset2)
> + && poly_int_tree_p (DECL_SIZE (field1), &size1)
> + && poly_int_tree_p (DECL_SIZE (field2), &size2))
> + {
> + offset1 = (offset1 << LOG2_BITS_PER_UNIT) + bit_offset1;
> + offset2 = (offset2 << LOG2_BITS_PER_UNIT) + bit_offset2;
> +
> + if (known_eq (offset1, offset2))
> + return 0;
> + if (!ranges_maybe_overlap_p (offset1, size1, offset2, size2))
> + return 1;
> + }
> + }
> + /* Resort to slower overlap checking by looking for matching types in
> + the middle of access path. */
> + return -1;
> +}
> +
> /* Try to disambiguate REF1 and REF2 under the assumption that MATCH1 and
> MATCH2 either point to the same address or are disjoint.
> MATCH1 and MATCH2 are assumed to be ref in the access path of REF1 and REF2
> @@ -1224,6 +1308,7 @@ nonoverlapping_component_refs_since_matc
> case the return value will precisely be false. */
> while (true)
> {
> + bool seen_noncomponent_ref_p = false;
> do
> {
> if (component_refs1.is_empty ())
> @@ -1233,6 +1318,8 @@ nonoverlapping_component_refs_since_matc
> return 0;
> }
> ref1 = component_refs1.pop ();
> + if (TREE_CODE (ref1) != COMPONENT_REF)
> + seen_noncomponent_ref_p = true;
> }
> while (!RECORD_OR_UNION_TYPE_P (TREE_TYPE (TREE_OPERAND (ref1, 0))));
>
> @@ -1245,17 +1332,15 @@ nonoverlapping_component_refs_since_matc
> return 0;
> }
> ref2 = component_refs2.pop ();
> + if (TREE_CODE (ref2) != COMPONENT_REF)
> + seen_noncomponent_ref_p = true;
> }
> while (!RECORD_OR_UNION_TYPE_P (TREE_TYPE (TREE_OPERAND (ref2, 0))));
>
> - /* Beware of BIT_FIELD_REF. */
> - if (TREE_CODE (ref1) != COMPONENT_REF
> - || TREE_CODE (ref2) != COMPONENT_REF)
> - {
> - ++alias_stats
> - .nonoverlapping_component_refs_since_match_p_may_alias;
> - return -1;
> - }
> + /* BIT_FIELD_REF and VIEW_CONVERT_EXPR are taken off the vectors
> + earlier. */
> + gcc_checking_assert (TREE_CODE (ref1) == COMPONENT_REF
> + && TREE_CODE (ref2) == COMPONENT_REF);
>
> tree field1 = TREE_OPERAND (ref1, 1);
> tree field2 = TREE_OPERAND (ref2, 1);
> @@ -1266,33 +1351,27 @@ nonoverlapping_component_refs_since_matc
> tree type1 = DECL_CONTEXT (field1);
> tree type2 = DECL_CONTEXT (field2);
>
> - /* We cannot disambiguate fields in a union or qualified union. */
> - if (type1 != type2 || TREE_CODE (type1) != RECORD_TYPE)
> + /* If we skipped array refs on type of different sizes, we can
> + no longer be sure that there are not partial overlaps. */
> + if (seen_noncomponent_ref_p
> + && !operand_equal_p (TYPE_SIZE (type1), TYPE_SIZE (type2), 0))
> {
> - ++alias_stats.nonoverlapping_component_refs_since_match_p_may_alias;
> + ++alias_stats
> + .nonoverlapping_component_refs_since_match_p_may_alias;
> return -1;
> }
>
> - if (field1 != field2)
> + int cmp = nonoverlapping_component_refs_p_1 (field1, field2);
> + if (cmp == -1)
> {
> - /* A field and its representative need to be considered the
> - same. */
> - if (DECL_BIT_FIELD_REPRESENTATIVE (field1) == field2
> - || DECL_BIT_FIELD_REPRESENTATIVE (field2) == field1)
> - {
> - ++alias_stats
> - .nonoverlapping_component_refs_since_match_p_must_overlap;
> - return 0;
> - }
> - /* Different fields of the same record type cannot overlap.
> - ??? Bitfields can overlap at RTL level so punt on them. */
> - if (DECL_BIT_FIELD (field1) && DECL_BIT_FIELD (field2))
> - {
> - ++alias_stats
> - .nonoverlapping_component_refs_since_match_p_must_overlap;
> - return 0;
> - }
> - ++alias_stats.nonoverlapping_component_refs_since_match_p_no_alias;
> + ++alias_stats
> + .nonoverlapping_component_refs_since_match_p_may_alias;
> + return -1;
> + }
> + else if (cmp == 1)
> + {
> + ++alias_stats
> + .nonoverlapping_component_refs_since_match_p_no_alias;
> return 1;
> }
> }
> @@ -1301,6 +1380,24 @@ nonoverlapping_component_refs_since_matc
> return 0;
> }
>
> +/* Return TYPE_UID which can be used to match record types we consider
> + same for TBAA purposes. */
> +
> +static inline int
> +ncr_type_uid (const_tree field)
> +{
> + /* ??? We cannot simply use the type of operand #0 of the refs here
> + as the Fortran compiler smuggles type punning into COMPONENT_REFs
> + for common blocks instead of using unions like everyone else. */
> + tree type = DECL_FIELD_CONTEXT (field);
> + /* With LTO types considered same_type_for_tbaa_p
> + from different translation unit may not have same
> + main variant. They however have same TYPE_CANONICAL. */
> + if (TYPE_CANONICAL (type))
> + return TYPE_UID (TYPE_CANONICAL (type));
> + return TYPE_UID (type);
> +}
> +
> /* qsort compare function to sort FIELD_DECLs after their
> DECL_FIELD_CONTEXT TYPE_UID. */
>
> @@ -1309,8 +1406,9 @@ ncr_compar (const void *field1_, const v
> {
> const_tree field1 = *(const_tree *) const_cast <void *>(field1_);
> const_tree field2 = *(const_tree *) const_cast <void *>(field2_);
> - unsigned int uid1 = TYPE_UID (DECL_FIELD_CONTEXT (field1));
> - unsigned int uid2 = TYPE_UID (DECL_FIELD_CONTEXT (field2));
> + unsigned int uid1 = ncr_type_uid (field1);
> + unsigned int uid2 = ncr_type_uid (field2);
> +
> if (uid1 < uid2)
> return -1;
> else if (uid1 > uid2)
> @@ -1377,10 +1475,9 @@ nonoverlapping_component_refs_p (const_t
> if (fieldsx.length () == 1
> && fieldsy.length () == 1)
> {
> - if ((DECL_FIELD_CONTEXT (fieldsx[0])
> - == DECL_FIELD_CONTEXT (fieldsy[0]))
> - && fieldsx[0] != fieldsy[0]
> - && !(DECL_BIT_FIELD (fieldsx[0]) && DECL_BIT_FIELD (fieldsy[0])))
> + if (same_type_for_tbaa (DECL_FIELD_CONTEXT (fieldsx[0]),
> + DECL_FIELD_CONTEXT (fieldsy[0])) == 1
> + && nonoverlapping_component_refs_p_1 (fieldsx[0], fieldsy[0]) == 1)
> {
> ++alias_stats.nonoverlapping_component_refs_p_no_alias;
> return true;
> @@ -1413,31 +1510,18 @@ nonoverlapping_component_refs_p (const_t
> {
> const_tree fieldx = fieldsx[i];
> const_tree fieldy = fieldsy[j];
> - tree typex = DECL_FIELD_CONTEXT (fieldx);
> - tree typey = DECL_FIELD_CONTEXT (fieldy);
> - if (typex == typey)
> - {
> - /* We're left with accessing different fields of a structure,
> - no possible overlap. */
> - if (fieldx != fieldy)
> - {
> - /* A field and its representative need to be considered the
> - same. */
> - if (DECL_BIT_FIELD_REPRESENTATIVE (fieldx) == fieldy
> - || DECL_BIT_FIELD_REPRESENTATIVE (fieldy) == fieldx)
> - ;
> - /* Different fields of the same record type cannot overlap.
> - ??? Bitfields can overlap at RTL level so punt on them. */
> - else if (DECL_BIT_FIELD (fieldx) && DECL_BIT_FIELD (fieldy))
> - ;
> - else
> - {
> - ++alias_stats.nonoverlapping_component_refs_p_no_alias;
> - return true;
> - }
> - }
> +
> + /* We're left with accessing different fields of a structure,
> + no possible overlap. */
> + if (same_type_for_tbaa (DECL_FIELD_CONTEXT (fieldx),
> + DECL_FIELD_CONTEXT (fieldy)) == 1
> + && nonoverlapping_component_refs_p_1 (fieldx, fieldy) == 1)
> + {
> + ++alias_stats.nonoverlapping_component_refs_p_no_alias;
> + return true;
> }
> - if (TYPE_UID (typex) < TYPE_UID (typey))
> +
> + if (ncr_type_uid (fieldx) < ncr_type_uid (fieldy))
> {
> i++;
> if (i == fieldsx.length ())
> Index: testsuite/g++.dg/lto/alias-3_0.C
> ===================================================================
> --- testsuite/g++.dg/lto/alias-3_0.C (nonexistent)
> +++ testsuite/g++.dg/lto/alias-3_0.C (working copy)
> @@ -0,0 +1,27 @@
> +/* { dg-lto-do run } */
> +/* { dg-lto-options { { -O3 -flto -fno-early-inlining } } } */
> +
> +struct a
> +{
> + int foo,bar;
> +};
> +struct b
> +{
> + struct a a[10];
> +};
> +
> +__attribute__ ((used)) struct b b, *bptr=&b, *bptr2=&b;
> +__attribute__ ((used)) int i,j;
> +
> +extern "C" void inline_me_late (void);
> +
> +int
> +main (void)
> +{
> + int jj=j;
> + bptr2->a[jj].bar = 0;
> + inline_me_late ();
> + if (!__builtin_constant_p (bptr2->a[jj].bar == 0))
> + __builtin_abort ();
> + return 0;
> +}
> Index: testsuite/g++.dg/lto/alias-3_1.c
> ===================================================================
> --- testsuite/g++.dg/lto/alias-3_1.c (nonexistent)
> +++ testsuite/g++.dg/lto/alias-3_1.c (working copy)
> @@ -0,0 +1,20 @@
> +/* { dg-lto-do run } */
> +/* { dg-lto-options { { -O3 -flto -fno-early-inlining } } } */
> +struct a
> +{
> + int foo,bar;
> +};
> +struct b
> +{
> + struct a a[10];
> +};
> +
> +extern struct b *bptr;
> +extern int i;
> +
> +void
> +inline_me_late (void)
> +{
> + bptr->a[i].foo=1;
> +}
> +
>
--
Richard Biener <rguenther@suse.de>
SUSE Linux GmbH, Maxfeldstrasse 5, 90409 Nuernberg, Germany;
GF: Felix Imendörffer, Mary Higgins, Sri Rasiah; HRB 21284 (AG NÌrnberg)
next prev parent reply other threads:[~2019-07-09 12:14 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-07-08 7:39 Jan Hubicka
2019-07-08 9:10 ` Richard Biener
2019-07-08 10:48 ` Jan Hubicka
2019-07-09 12:02 ` Jan Hubicka
2019-07-09 12:21 ` Richard Biener [this message]
2019-07-09 12:41 ` Jan Hubicka
2019-07-09 12:52 ` Richard Biener
2019-07-09 13:10 ` Jan Hubicka
2019-07-09 13:30 ` Richard Biener
2019-07-09 13:37 ` Jan Hubicka
2019-07-09 13:41 ` Richard Biener
2019-07-09 21:03 ` Bernhard Reutner-Fischer
2019-07-11 8:29 ` Rainer Orth
2019-07-16 9:30 ` Jan Hubicka
2019-07-16 11:58 ` Rainer Orth
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=alpine.LSU.2.20.1907091402510.2976@zhemvz.fhfr.qr \
--to=rguenther@suse.de \
--cc=d@dcepelik.cz \
--cc=gcc-patches@gcc.gnu.org \
--cc=hubicka@ucw.cz \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).