public inbox for gdb-patches@sourceware.org
 help / color / mirror / Atom feed
From: Andrew Burgess <aburgess@redhat.com>
To: Thiago Jung Bauermann <thiago.bauermann@linaro.org>,
	Simon Marchi <simark@simark.ca>
Cc: Thiago Jung Bauermann via Gdb-patches
	<gdb-patches@sourceware.org>,
	Simon Marchi <simon.marchi@efficios.com>
Subject: Re: [PATCH v3 5/8] gdbserver: Transmit target description ID in thread list and stop reply
Date: Fri, 03 Feb 2023 16:29:47 +0000	[thread overview]
Message-ID: <87v8kir9tw.fsf@redhat.com> (raw)
In-Reply-To: <875ycja2n6.fsf@linaro.org>

Thiago Jung Bauermann <thiago.bauermann@linaro.org> writes:

> Simon Marchi <simark@simark.ca> writes:
>
>> On 2/2/23 14:52, Thiago Jung Bauermann wrote:
>>> 
>>> Simon Marchi <simark@simark.ca> writes:
>>> 
>>>>> Having read some of the later patches, I have some additional thoughts
>>>>> here:
>>>>>
>>>>> I think we should make it explicit here that IDs are connection wide,
>>>>> not per-process.  We should also make it clear that GDB might[1] cache
>>>>> target descriptions per remote connection, and so a remote target should
>>>>> not reuse a target description ID except where the target description is
>>>>> identical.
>>> 
>>> Ah, good point. I will make that clarification.
>>> 
>>>>> [1] I say "GDB might" here because if we say "GDB will" then this would
>>>>> imply each target description will only be asked for once.  And I
>>>>> figure, why be overly restrictive.
>>>>
>>>> Thanks for pointing this out, I had the same thought while reading the
>>>> patch.
>>>>
>>>> In my original idea, I imagined that target description IDs could be
>>>> some hashes computed from the XML content (a bit like git hashes or ELF
>>>> build IDs), such that a given target description would always have the
>>>> same ID.  This would give clients the possibility to cache target
>>>> descriptions locally, a bit like the index cache.  It did sound nice,
>>>> but perhaps it's not really important.
>>> 
>>> Ah, sorry I misunderstood this part of your suggestion. I thought that
>>> the caching was supposed to be limited to the duration of the connection,
>>> and thus the m_tdescs map in struct remote state would be enough to
>>> provide that functionality. Do you mean that the cache should be on
>>> disk, so that it survives GDB quitting? I can look into that if you want
>>> and implement it, if not complicated.
>>
>> To be clear, I'm not asking you to implement an on-disk cache, I'm just
>> trying to think about what the limitations of the proposed solution are.
>> Because mistakes or shortcomings introduced in the remote protocol (like
>> any API / ABI meant to be stable) are difficult to fix afterwards.  If
>> there is a chance we want to do an on-disk cache (or share a cache
>> between multiple concurrent connections), we should think about that
>> now.
>
> Indeed, that's a good idea.
>
>> On one hand, perhaps we consider that target descriptions are small
>> enough that there's no point in having a persistent cache like that.
>> The time to fetch each target description once per connection is
>> probably insignificant.
>
> I don't have a good intuition about that. My uneducated guess is that
> it's not worth it, but as you say I think it's a good idea to leave the
> door open if we can.
>
>> On the other hand, perhaps doing the hash approach is easy enough that
>> we might as well do it, and that just leaves more doors open.  In my
>> mind, as a start, we could just pass the XML of the tdesc through a
>> sha256 function, possibly doing something more sophisticated later (this
>> means the hash of a given tdesc could change over time, but still two
>> identical hashes mean identical tdescs).  However, I don't think there
>> are sha256 functions built in on the OSes we want to support, so we'd
>> have to depend on some external library.  And that complicates things
>> quite a bit...  We already have a CRC32 function in gdbserver, but I
>> don't know if that's good enough to be confident there won't be
>> collisions.
>
> I tried to research this a bit, but I'm not confident enough in my
> knowledge of the subject to reach a conclusion. I only know enough about
> hashes to be dangerous. I did find a hash testsuite called SMhasher
> which seems to be well regarded, and its website¹ has
> reports for CRC32², which shows that it fails many collision and
> distribution bias tests. So it doesn't look like it is very reliable for
> using as a hash.
>
> We do support libxxhash as an optional dependency, so we could use that
> if gdbserver is linked with it. Its SMhasher report³ looks good, IUUC
> (but not the 32 bits version though⁴).
>
> We could also vendor coreutils's lib/sha256.[ch] files. They're 660
> lines in total.
>
>>> In this case, I think the tdesc ID should be of the format
>>> <prefix>:<hex number>, so that <prefix> can be “sha1”, “sha256” etc to
>>> indicate that <hex number> is a hash with the given algo, or even
>>> “number” to indicate that it's a simple integer like it is today.
>>> 
>>> Perhaps we can do the prefix thing even if not implementing the cache,
>>> to leave the possibility of adding it in the future?
>>
>> I think I would choose between hash and number, but not do both, I don't
>> see the need to have both.
>
> I suggested a prefix to designate an integer ID as a way to implement
> only this possibility for now, but which would allow implementing hash
> IDs later, as additional prefixes. Or another possibility is that if we
> later decide to support hashes, we could add a separate protocol feature
> to signal that.

So, I'd also be in favour of supporting the simple "just a number"
scheme.  Not everything that talks to GDB is our GDBServer, and not
every target might have the right hashing library.  I'd hate to force
folk to have to implement some hashing code, just to use this feature of
the remote protocol.

I have a proposal for how to negotiate different ID types without
adding a prefix to every ID sent from gdbserver...

We don't need to use a prefix in the actual ID.  I believe you're
already adding a qSupported feature for per-thread-tdesc.  The
qSupported mechanism already supports value passing.

So, we could pass a value back and forth in the qSupported negotiation
where GDB and GDBServer can agree on a numbering scheme.

If GDB sends out:

  per-thread-tdesc=value

Where `value` is a comma separated list of letters, each letter
representing a scheme that GDB understands:

  N: Each ID is a number,

  H: Each ID is a hash of the target description contents.

Then GDBServer replies with:

  per-thread-tdesc=value

Where `value` is a single letter drawn from the set of letters that GDB
sent out, this is the scheme that GDBServer will be using.  Thus,
GDBServer might reply with _one_ of the following:

  per-thread-tdesc=N
  per-thread-tdesc=H

If GDB only offers a scheme that GDBServer doesn't understand, e.g. GDB
sends:

  per-thread-tdesc=X

Then GDBServer just doesn't send this feature back, effectively claiming
it doesn't support 'per-thread-tdesc'.

The great thing about this is that the hard work here is all in writing
the documentation.

To begin with you _only_ need to support the 'N' scheme.  GDB always
sends out just 'per-thread-tdesc=N', GDBServer just checks that the
value is 'N', and replies 'per-thread-tdesc=N'.  GDB checks that the
return value is also 'N', and then we carry on as before.  BUT, we now
have a planned route to add more complex schemes in the future.

Thanks,
Andrew


  reply	other threads:[~2023-02-03 16:29 UTC|newest]

Thread overview: 94+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-01-30  4:45 [PATCH v3 0/8] gdbserver improvements for AArch64 SVE support Thiago Jung Bauermann
2023-01-30  4:45 ` [PATCH v3 1/8] gdbserver: Add assert in find_register_by_number Thiago Jung Bauermann
2023-01-31 17:05   ` Simon Marchi
2023-01-31 19:49     ` Thiago Jung Bauermann
2023-02-01 15:43       ` Simon Marchi
2023-01-30  4:45 ` [PATCH v3 2/8] gdbserver: Add PID parameter to linux_get_auxv and linux_get_hwcap Thiago Jung Bauermann
2023-02-01  9:07   ` Luis Machado
2023-02-01 10:54   ` Andrew Burgess
2023-02-01 16:01     ` Simon Marchi
2023-02-01 19:33       ` Thiago Jung Bauermann
2023-02-01 19:53         ` Simon Marchi
2023-02-01 21:55           ` Thiago Jung Bauermann
2023-02-06 19:54   ` Pedro Alves
2023-02-06 20:16     ` Simon Marchi
2023-02-07 15:19       ` Pedro Alves
2023-02-07 21:47         ` Thiago Jung Bauermann
2023-02-09  1:31           ` Simon Marchi
2023-02-10  3:54             ` Thiago Jung Bauermann
2023-02-07 22:28     ` Thiago Jung Bauermann
2023-01-30  4:45 ` [PATCH v3 3/8] gdbserver/linux-aarch64: Factor out function to get aarch64_features Thiago Jung Bauermann
2023-02-01  8:59   ` Luis Machado
2023-02-01 16:04     ` Simon Marchi
2023-02-01 22:13       ` Thiago Jung Bauermann
2023-01-30  4:45 ` [PATCH v3 4/8] gdbserver/linux-aarch64: When thread stops, update its target description Thiago Jung Bauermann
2023-02-01  9:05   ` Luis Machado
2023-02-01 11:06   ` Andrew Burgess
2023-02-01 16:21     ` Simon Marchi
2023-02-01 16:32       ` Luis Machado
2023-02-02  2:54         ` Thiago Jung Bauermann
2023-02-02  3:47           ` Simon Marchi
2023-02-03  3:47             ` Thiago Jung Bauermann
2023-02-03 11:13               ` Luis Machado
2023-02-04 15:26                 ` Thiago Jung Bauermann
2023-02-03 11:11             ` Luis Machado
2023-02-04 15:21               ` Thiago Jung Bauermann
2023-02-06  9:07                 ` Luis Machado
2023-02-06 12:15                   ` Thiago Jung Bauermann
2023-02-06 20:29                 ` Pedro Alves
2023-02-07  8:11                   ` Luis Machado
2023-02-07 14:39                     ` Thiago Jung Bauermann
2023-02-03 10:57           ` Luis Machado
2023-02-04  6:18             ` Thiago Jung Bauermann
2023-02-06 20:26           ` Pedro Alves
2023-02-07 21:06             ` Thiago Jung Bauermann
2023-02-09  2:46               ` Simon Marchi
2023-02-10  3:29                 ` Thiago Jung Bauermann
2023-02-10 14:56                   ` Luis Machado
2023-02-10 15:04                     ` Tom Tromey
2023-02-10 15:28                       ` Luis Machado
2023-02-10 17:26                       ` Thiago Jung Bauermann
2023-02-10 21:01                         ` Simon Marchi
2023-01-30  4:45 ` [PATCH v3 5/8] gdbserver: Transmit target description ID in thread list and stop reply Thiago Jung Bauermann
2023-01-30 12:52   ` Eli Zaretskii
2023-01-30 14:05     ` Thiago Jung Bauermann
2023-02-01  9:39   ` Luis Machado
2023-02-01 12:07   ` Andrew Burgess
2023-02-01 13:03     ` Eli Zaretskii
2023-02-01 17:37     ` Simon Marchi
2023-02-02 20:36       ` Thiago Jung Bauermann
2023-02-02 20:56         ` Simon Marchi
2023-02-01 20:46     ` Simon Marchi
2023-02-02 21:43       ` Thiago Jung Bauermann
2023-02-01 14:51   ` Andrew Burgess
2023-02-01 17:03     ` Simon Marchi
2023-02-02 19:52       ` Thiago Jung Bauermann
2023-02-02 20:51         ` Simon Marchi
2023-02-03  2:44           ` Thiago Jung Bauermann
2023-02-03 16:29             ` Andrew Burgess [this message]
2023-02-04  6:08               ` Thiago Jung Bauermann
2023-02-03 11:22       ` Luis Machado
2023-02-03 12:50         ` Simon Marchi
2023-01-30  4:45 ` [PATCH v3 6/8] gdb/remote: Parse tdesc field in stop reply and threads list XML Thiago Jung Bauermann
2023-02-01  9:52   ` Luis Machado
2023-02-05  0:06     ` Thiago Jung Bauermann
2023-02-06  9:10       ` Luis Machado
2023-02-01 14:32   ` Andrew Burgess
2023-02-01 19:50     ` Simon Marchi
2023-02-01 20:16       ` Simon Marchi
2023-02-03 11:27         ` Luis Machado
2023-02-03 13:19           ` Simon Marchi
2023-02-03 16:33             ` Andrew Burgess
2023-01-30  4:45 ` [PATCH v3 7/8] gdb/aarch64: Detect vector length changes when debugging remotely Thiago Jung Bauermann
2023-02-01  9:58   ` Luis Machado
2023-02-01 15:26   ` Andrew Burgess
2023-02-01 20:20     ` Simon Marchi
2023-02-03 11:31       ` Luis Machado
2023-02-03 16:38       ` Andrew Burgess
2023-02-03 19:07         ` Simon Marchi
2023-01-30  4:45 ` [PATCH v3 8/8] gdb/testsuite: Add test to exercise multi-threaded AArch64 SVE inferiors Thiago Jung Bauermann
2023-02-01 10:10   ` Luis Machado
2023-02-06 19:11 ` [PATCH v3 0/8] gdbserver improvements for AArch64 SVE support Pedro Alves
2023-02-06 20:05   ` Simon Marchi
2023-02-06 21:06     ` Pedro Alves
2023-02-07 13:49       ` Simon Marchi

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87v8kir9tw.fsf@redhat.com \
    --to=aburgess@redhat.com \
    --cc=gdb-patches@sourceware.org \
    --cc=simark@simark.ca \
    --cc=simon.marchi@efficios.com \
    --cc=thiago.bauermann@linaro.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).