From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <simon.marchi@polymtl.ca>
Received: from smtp.polymtl.ca (smtp.polymtl.ca [132.207.4.11])
 by sourceware.org (Postfix) with ESMTPS id B42C63851C33
 for <gdb-patches@sourceware.org>; Mon,  7 Dec 2020 16:06:48 +0000 (GMT)
DMARC-Filter: OpenDMARC Filter v1.3.2 sourceware.org B42C63851C33
Received: from simark.ca (simark.ca [158.69.221.121]) (authenticated bits=0)
 by smtp.polymtl.ca (8.14.7/8.14.7) with ESMTP id 0B7G6e3R022798
 (version=TLSv1/SSLv3 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); 
 Mon, 7 Dec 2020 11:06:46 -0500
DKIM-Filter: OpenDKIM Filter v2.11.0 smtp.polymtl.ca 0B7G6e3R022798
Received: from [10.0.0.11] (173-246-6-90.qc.cable.ebox.net [173.246.6.90])
 (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits)
 key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256)
 (No client certificate requested)
 by simark.ca (Postfix) with ESMTPSA id E45041E552;
 Mon,  7 Dec 2020 11:06:39 -0500 (EST)
Subject: Re: [PATCH 4/4] gdb: fix value_subscript when array upper bound is
 not known
To: Joel Brobecker <brobecker@adacore.com>,
 Simon Marchi via Gdb-patches <gdb-patches@sourceware.org>
Cc: Simon Marchi <simon.marchi@efficios.com>
References: <20201123162120.3778679-1-simon.marchi@efficios.com>
 <20201123162120.3778679-5-simon.marchi@efficios.com>
 <20201206075437.GC327270@adacore.com>
From: Simon Marchi <simon.marchi@polymtl.ca>
Message-ID: <8a48f527-bd8f-5aa9-f232-7523359515fa@polymtl.ca>
Date: Mon, 7 Dec 2020 11:06:39 -0500
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101
 Thunderbird/78.4.3
MIME-Version: 1.0
In-Reply-To: <20201206075437.GC327270@adacore.com>
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Language: en-US
Content-Transfer-Encoding: 7bit
X-Poly-FromMTA: (simark.ca [158.69.221.121]) at Mon, 7 Dec 2020 16:06:41 +0000
X-Spam-Status: No, score=-3.6 required=5.0 tests=BAYES_00, DKIM_SIGNED,
 DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, NICE_REPLY_A, RCVD_IN_MSPIKE_H3,
 RCVD_IN_MSPIKE_WL, SPF_HELO_PASS, SPF_PASS,
 TXREP autolearn=ham autolearn_force=no version=3.4.2
X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on
 server2.sourceware.org
X-BeenThere: gdb-patches@sourceware.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Gdb-patches mailing list <gdb-patches.sourceware.org>
List-Unsubscribe: <https://sourceware.org/mailman/options/gdb-patches>,
 <mailto:gdb-patches-request@sourceware.org?subject=unsubscribe>
List-Archive: <https://sourceware.org/pipermail/gdb-patches/>
List-Post: <mailto:gdb-patches@sourceware.org>
List-Help: <mailto:gdb-patches-request@sourceware.org?subject=help>
List-Subscribe: <https://sourceware.org/mailman/listinfo/gdb-patches>,
 <mailto:gdb-patches-request@sourceware.org?subject=subscribe>
X-List-Received-Date: Mon, 07 Dec 2020 16:06:50 -0000


On 2020-12-06 2:54 a.m., Joel Brobecker wrote:
> On Mon, Nov 23, 2020 at 11:21:20AM -0500, Simon Marchi via Gdb-patches wrote:
>> Since commit 7c6f27129631 ("gdb: make get_discrete_bounds check for
>> non-constant range bounds"), subscripting  flexible array member fails:
>>
>>      struct no_size
>>      {
>>        int n;
>>        int items[];
>>      };
>>
>>      (gdb) p *ns
>>      $1 = {n = 3, items = 0x5555555592a4}
>>      (gdb) p ns->items[0]
>>      Cannot access memory at address 0xfffe555b733a0164
>>      (gdb) p *((int *) 0x5555555592a4)
>>      $2 = 101  <--- we would expect that
>>      (gdb) p &ns->items[0]
>>      $3 = (int *) 0xfffe5559ee829a24  <--- wrong address
>>
>> Since the flexible array member (items) has an unspecified size, the array type
>> created for it in the DWARF doesn't have dimensions (this is with gcc 9.3.0,
>> Ubuntu 20.04):
>>
>>      0x000000a4:   DW_TAG_array_type
>>                      DW_AT_type [DW_FORM_ref4]       (0x00000038 "int")
>>                      DW_AT_sibling [DW_FORM_ref4]    (0x000000b3)
>>
>>      0x000000ad:     DW_TAG_subrange_type
>>                        DW_AT_type [DW_FORM_ref4]     (0x00000031 "long unsigned int")
>>
>> This causes GDB to create a range type (TYPE_CODE_RANGE) with a defined
>> constant low bound (dynamic_prop with kind PROP_CONST) and an undefined
>> high bound (dynamic_prop with kind PROP_UNDEFINED).
>>
>> value_subscript gets both bounds of that range using
>> get_discrete_bounds.  Before commit 7c6f27129631, get_discrete_bounds
>> didn't check the kind of the dynamic_props and would just blindly read
>> them as if they were PROP_CONST.  It would return 0 for the high bound,
>> because we zero-initialize the range_bounds structure.  And it didn't
>> really matter in this case, because the returned high bound wasn't used
>> in the end.
>>
>> Commit 7c6f27129631 changed get_discrete_bounds to return a failure if
>> either the low or high bound is not a constant, to make sure we don't
>> read a dynamic prop that isn't a PROP_CONST as a PROP_CONST.  This
>> change made get_discrete_bounds start to return a failure for that
>> range, and as a result would not set *lowp and *highp.  And since
>> value_subscript doesn't check get_discrete_bounds' return value, it just
>> carries on an uses an uninitialized value for the low bound.  If
>               ^^
> 
> "an" -> "and"
> 
>> value_subscript did check the return value of get_discrete_bounds, we
>> would get an error message instead of a bogus value.  But it would still
>> be a bug, as we wouldn't be able to print the flexible array member's
>> elements.
>>
>> Looking at value_subscript, we see that the low bound is always needed,
>> but the high bound is only needed if !c_style.  So, change
>> value_subscript to use get_discrete_low_bound and
>> get_discrete_high_bound separately.  This fixes the case described
>> above, where the low bound is known but the high bound isn't (and is not
>> needed).  This restores the original behavior without accessing a
>> dynamic_prop in a wrong way.
>>
>> A test is added.  In addition to the case described above, a case with
>> an array member of size 0 is added, which is a GNU C extension that
>> existed before flexible array members were introduced.  That case
>> currently fails when compiled with gcc <= 8.  gcc <= 8 produces DWARF
>> similar to the one shown above, while gcc 9 adds a DW_AT_count of 0 in
>> there, which makes the high bound known.  A case where an array member
>> of size 0 is the only member of the struct is also added, as that was
>> how PR 28675 was originally reported, and it's an interesting corner
>> case that I think could trigger other funny bugs.
>>
>> Question about the implementation: in value_subscript, I made it such
>> that if the low or high bound is unknown, we fall back to zero.  That
>> effectively makes it the same as it was before 7c6f27129631.  But should
>> we instead error() out?
> 
> Good question!
> 
> In thinking about it, I think there is a bigger question. But before
> asking the bigger question, why would we treat an unknown high bound
> differently from an unknown high bound?

Do you mean "unknown low bound differently from an unknown high bound"?

I don't really have a solid
> answer to that question. I almost want to say that we want to treat
> both the same, which argues in favor of the approach you took in
> this patch, and absent a good answer to that question, because that's
> what we were already doing in the past, it's another justification
> for the status quo.
> 
> Now, the bigger question is: Why are we even getting to this point?
> Shouldn't the bounds simply be resolved before we do the subscripting?

If the array is c-style, there's no concept of high bound, so I don't 
think there's a point resolving the high bound then.

> Thinking about this questions brings me back to the difficulties
> that AdaCore had with dealing with this kind of situation where
> resolving the array bounds can lead to values whose size if ginormous,
> and thus the (impossible) need to be careful to never actually fetch
> the value -- something I believe we side-stepped by replacing the value
> by a pointer to the array itself. It was tricky, but the question of
> whether we should only subscript arrays remains open, especially when
> the actual low bound after bound resolution ends up being nonzero,
> because it then affects which element of the array you display.

I don't have much experience with languages others than C/C++ that have 
smarter array ranges, so I don't have any meaningful opinion on this.

> Another smaller question is about what we should be doing if we detect
> a situation when the user tries to subscript an array at an index
> which is outside the array's range. I believe we should let the user
> try it, but should we warn, for instance?

Again, I've never really been a user of such a language.  Intuitively, 
I'd like the debugger to work as close to the source language as 
possible, so I'd like it to validate the bounds in my expression.  But 
it might be that people who use these languages sometimes need to 
side-step the language's protections.

> 
> While those are interesting discussions, given the impact of this
> regressions, and the fact that this patch is restoring the status quo,
> I'm in favor of discussing the above separately from this patch.

Ok, thanks.

> 
>> gdb/ChangeLog:
>>
>> 	PR 26875, PR 26901
>> 	* gdbtypes.c (get_discrete_low_bound): Make non-static.
>> 	(get_discrete_high_bound): Make non-static.
>> 	* gdbtypes.h (get_discrete_low_bound): New declaration.
>> 	(get_discrete_high_bound): New declaration.
>> 	* valarith.c (value_subscript): Only fetch high bound if
>> 	necessary.
>>
>> gdb/testsuite/ChangeLog:
>>
>> 	PR 26875, PR 26901
>> 	* gdb.base/flexible-array-member.c: New test.
>> 	* gdb.base/flexible-array-member.exp: New test.
>>
>> Change-Id: I832056f80e6c56f621f398b4780d55a3a1e299d7
> 
> Small reminder to remove this Change-Id.

As mentioned in my reply on patch 1, I'd prefer to keep those if it 
doesn't bother you too much, as they are useful for bookkeeping on my end.

> 
> Other than that, the change looks OK to me.
> 
> I believe, between the impact of the fix, the fact that it is
> a recent regression, and the relatively straightforward nature
> of your patch series, I believe it will be safe to backport to
> the gdb-10-branch.

Thanks for the review!

Simon