From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <amacleod@redhat.com>
Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124])
	by sourceware.org (Postfix) with ESMTPS id 35B353858CDA
	for <gcc-patches@gcc.gnu.org>; Tue,  4 Oct 2022 15:42:36 +0000 (GMT)
DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 35B353858CDA
Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=redhat.com
Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=redhat.com
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com;
	s=mimecast20190719; t=1664898155;
	h=from:from:reply-to:subject:subject:date:date:message-id:message-id:
	 to:to:cc:cc:mime-version:mime-version:content-type:content-type:
	 content-transfer-encoding:content-transfer-encoding:
	 in-reply-to:in-reply-to:references:references;
	bh=tR1Tw2LBPvXdn+C9ICKryJcKrYh8fIefrSFSoz6NjMY=;
	b=gReq27pDvrv02VlUz8U2ytuSAiFvp3LS+gWZZTrIm64AY3t1yAkyHpoUZiBNjZqg0BBAPH
	mQJw88hZWbtWPAjefPRHHnWdhEqmCQmCyJuIlvuVVsZOqtjRyvqHw0NnMXm5GaalBKdR9P
	erqgEfIVYhdbupjBY3/q7dlUsDGiuOM=
Received: from mail-qv1-f72.google.com (mail-qv1-f72.google.com
 [209.85.219.72]) by relay.mimecast.com with ESMTP with STARTTLS
 (version=TLSv1.3, cipher=TLS_AES_128_GCM_SHA256) id
 us-mta-252-AnocV9GRM5OUildOK68pOw-1; Tue, 04 Oct 2022 11:42:34 -0400
X-MC-Unique: AnocV9GRM5OUildOK68pOw-1
Received: by mail-qv1-f72.google.com with SMTP id mo5-20020a056214330500b004ad711537a6so9081262qvb.10
        for <gcc-patches@gcc.gnu.org>; Tue, 04 Oct 2022 08:42:34 -0700 (PDT)
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=1e100.net; s=20210112;
        h=content-transfer-encoding:in-reply-to:from:references:cc:to
         :content-language:subject:user-agent:mime-version:date:message-id
         :x-gm-message-state:from:to:cc:subject:date;
        bh=tR1Tw2LBPvXdn+C9ICKryJcKrYh8fIefrSFSoz6NjMY=;
        b=kSWqZ2cx5P1St7izY2mHpHu1VHlAckv99eG+QYZiAF8TzZipOPFBvTFOOUtKCnOh/v
         UePHxIf8s3b1KvMEQnfs3zIcGkNaAOc9225rkgLiTLK2RZxh0BuAJBrY9bfQJ1QfW1ym
         Ay1JIVd57WOf6gUuzjKyEgNTL5yjIarRvb7rDTS/TDi6m/AHkbMfwwjclHANIumtdlRX
         WoJcP5kIPovqYvRO4CONG/DA5Y5eBcbAY5TInuwOxQYp8g1hSpwKe+rTeEDBcNxT75qz
         Y1OmAP7VDkAJt2ZvlXcF5FqHjNEu4aLKepQ90wC5R7pGaO4KGSKaNCxV6LIDB9js8ydl
         PKJg==
X-Gm-Message-State: ACrzQf1/ejhnH4PO/UHtRfOCAF2t8aKVwBRnTJxTlcEXR+bDrag5Wz2R
	Y8Lsr0erXtwcvCmi3OmKlGjkQoPF26kMCa9OPIc4hKSctAgIeholIzsf9t8XaNCcP55N3qecJbf
	kjfpwI5qFF2FhULXMyg==
X-Received: by 2002:a05:620a:1a17:b0:6ce:7c1b:c27f with SMTP id bk23-20020a05620a1a1700b006ce7c1bc27fmr17412031qkb.42.1664898154275;
        Tue, 04 Oct 2022 08:42:34 -0700 (PDT)
X-Google-Smtp-Source: AMsMyM6GfDSckzDPMo7TXxWM0AfoSsKALA0X+kMc1akuHaqcReHZt61ukZmMpC0Mk7dnRxjSDsV79A==
X-Received: by 2002:a05:620a:1a17:b0:6ce:7c1b:c27f with SMTP id bk23-20020a05620a1a1700b006ce7c1bc27fmr17412013qkb.42.1664898153992;
        Tue, 04 Oct 2022 08:42:33 -0700 (PDT)
Received: from ?IPV6:2607:fea8:a263:f600::3dbe? ([2607:fea8:a263:f600::3dbe])
        by smtp.gmail.com with ESMTPSA id cb24-20020a05622a1f9800b0034355a352d1sm12164725qtb.92.2022.10.04.08.42.33
        (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128);
        Tue, 04 Oct 2022 08:42:33 -0700 (PDT)
Message-ID: <974d3399-7eac-803d-2c64-fb7d7bf3f71f@redhat.com>
Date: Tue, 4 Oct 2022 11:42:32 -0400
MIME-Version: 1.0
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101
 Thunderbird/102.2.1
Subject: Re: [COMMITTED] Convert nonzero mask in irange to wide_int.
To: Aldy Hernandez <aldyh@redhat.com>,
 Richard Biener <richard.guenther@gmail.com>
Cc: GCC patches <gcc-patches@gcc.gnu.org>
References: <CAGm3qMU1u_-_4Fy1wWk59FT+LbvsE1vFKdKq8pQ5ePv9uXZzag@mail.gmail.com>
 <07FCA378-7E86-4E06-B506-FED0C60CE31C@gmail.com>
 <CAGm3qMXQTs-B8mTBbRQYUn3ykRw1EP5TygjYQF_wfLti6_P4aw@mail.gmail.com>
From: Andrew MacLeod <amacleod@redhat.com>
In-Reply-To: <CAGm3qMXQTs-B8mTBbRQYUn3ykRw1EP5TygjYQF_wfLti6_P4aw@mail.gmail.com>
X-Mimecast-Spam-Score: 0
X-Mimecast-Originator: redhat.com
Content-Language: en-US
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
X-Spam-Status: No, score=-7.6 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,NICE_REPLY_A,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_NONE,TXREP autolearn=ham autolearn_force=no version=3.4.6
X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org
List-Id: <gcc-patches.gcc.gnu.org>


On 10/4/22 11:14, Aldy Hernandez wrote:
> On Tue, Oct 4, 2022 at 4:34 PM Richard Biener
> <richard.guenther@gmail.com> wrote:
>>
>>
>>> Am 04.10.2022 um 16:30 schrieb Aldy Hernandez <aldyh@redhat.com>:
>>>
>>> ﻿On Tue, Oct 4, 2022 at 3:27 PM Andrew MacLeod <amacleod@redhat.com> wrote:
>>>>
>>>>> On 10/4/22 08:13, Aldy Hernandez via Gcc-patches wrote:
>>>>>> On Tue, Oct 4, 2022, 13:28 Aldy Hernandez <aldyh@redhat.com> wrote:
>>>>>> On Tue, Oct 4, 2022 at 9:55 AM Richard Biener
>>>>>> <richard.guenther@gmail.com> wrote:
>>>>>>> Am 04.10.2022 um 09:36 schrieb Aldy Hernandez via Gcc-patches <
>>>>>> gcc-patches@gcc.gnu.org>:
>>>>>>>> ﻿The reason the nonzero mask was kept in a tree was basically inertia,
>>>>>>>> as everything in irange is a tree.  However, there's no need to keep
>>>>>>>> it in a tree, as the conversions to and from wide ints are very
>>>>>>>> annoying.  That, plus special casing NULL masks to be -1 is prone
>>>>>>>> to error.
>>>>>>>>
>>>>>>>> I have not only rewritten all the uses to assume a wide int, but
>>>>>>>> have corrected a few places where we weren't propagating the masks, or
>>>>>>>> rather pessimizing them to -1.  This will become more important in
>>>>>>>> upcoming patches where we make better use of the masks.
>>>>>>>>
>>>>>>>> Performance testing shows a trivial improvement in VRP, as things like
>>>>>>>> irange::contains_p() are tied to a tree.  Ughh, can't wait for trees in
>>>>>>>> iranges to go away.
>>>>>>> You want trailing wide int storage though.  A wide_int is quite large.
>>>>>> Absolutely, this is only for short term storage.  Any time we need
>>>>>> long term storage, say global ranges in SSA_NAME_RANGE_INFO, we go
>>>>>> through vrange_storage which will stream things in a more memory
>>>>>> efficient manner.  For irange, vrange_storage will stream all the
>>>>>> sub-ranges, including the nonzero bitmask which is the first entry in
>>>>>> such storage, as trailing_wide_ints.
>>>>>>
>>>>>> See irange_storage_slot to see how it lives in GC memory.
>>>>>>
>>>>> That being said, the ranger's internal cache uses iranges, albeit with a
>>>>> squished down number of subranges (the minimum amount to represent the
>>>>> range).  So each cache entry will now be bigger by the difference between
>>>>> one tree and one wide int.
>>>>>
>>>>> I wonder if we should change the cache to use vrange_storage. If not now,
>>>>> then when we convert all the subranges to wide ints.
>>>>>
>>>>> Of course, the memory pressure of the cache is not nearly as problematic as
>>>>> SSA_NAME_RANGE_INFO. The cache only stores names it cares about.
>>>> Rangers cache can be a memory bottleneck in pathological cases..
>>>> Certainly not as bad as it use to be, but I'm sure it can still be
>>>> problematic.    Its suppose to be a memory efficient representation
>>>> because of that.  The cache can have an entry for any live ssa-name
>>>> (which means all of them at some point in the IL) multiplied by a factor
>>>> involving the number of dominator blocks and outgoing edges ranges are
>>>> calculated on.   So while SSA_NAME_RANGE_INFO is a linear thing, the
>>>> cache lies somewhere between a logarithmic and exponential factor based
>>>> on the CFG size.
>>> Hmmm, perhaps the ultimate goal here should be to convert the cache to
>>> use vrange_storage, which uses trailing wide ints for all of the end
>>> points plus the masks (known_ones included for the next release).
>>>
>>>> if you are growing the common cases of 1 to 2 endpoints to more than
>>>> double in size (and most of the time not be needed), that would not be
>>>> very appealing :-P  If we have any wide-ints, they would need to be a
>>>> memory efficient version.   The Cache uses an irange_allocator, which is
>>>> suppose to provide a memory efficient objects.. hence why it trims the
>>>> number of ranges down to only what is needed.  It seems like a trailing
>>>> wide-Int might be in order based on that..
>>>>
>>>> Andrew
>>>>
>>>>
>>>> PS. which will be more problematic if you eventually introduce a
>>>> known_ones wide_int.    I thought the mask tracking was/could be
>>>> something simple like  HOST_WIDE_INT..  then you only tracks masks in
>>>> types up to the size of a HOST_WIDE_INT.  then storage and masking is
>>>> all trivial without going thru a wide_int.    Is that not so/possible?
>>> That's certainly easy and cheaper to do.  The hard part was fixing all
>>> the places where we weren't keeping the masks up to date, and that's
>>> done (sans any bugs ;-)).
>>>
>>> Can we get consensus here on only tracking masks for type sizes less
>>> than HOST_WIDE_INT?  I'd hate to do all the work only to realize we
>>> need to track 512 bit masks on a 32-bit host cross :-).
>> 64bits are not enough, 128 might be.  But there’s trailing wide int storage so I don’t see the point in restricting ourselves?
> Fair enough.  Perhaps we should bite the bullet and convert the cache
> to vrange_storage which is all set up for streaming irange's with
> trailing_wide_ints.  No changes should be necessary for irange, since
> we never have more than 3-4 live at any one time.  It's the cache that
> needs twiddling.
>
Wouldnt it be irange_allocator that needs twiddling?  It purpose in life 
is to allocate iranges for memory storage...  the cache is just a 
client, as is rangers global cache, etc...  that was the intention of 
irange_allocator to isolate clients from having to worry about memory 
storage issues?

Or is that problematic?


Andrew