public inbox for gdb-prs@sourceware.org
help / color / mirror / Atom feed
* [Bug rust/28637] New: Rust characters will be encoded using DW_ATE_UTF
@ 2021-11-29 18:09 tromey at sourceware dot org
2021-11-29 18:09 ` [Bug rust/28637] " tromey at sourceware dot org
` (4 more replies)
0 siblings, 5 replies; 6+ messages in thread
From: tromey at sourceware dot org @ 2021-11-29 18:09 UTC (permalink / raw)
To: gdb-prs
https://sourceware.org/bugzilla/show_bug.cgi?id=28637
Bug ID: 28637
Summary: Rust characters will be encoded using DW_ATE_UTF
Product: gdb
Version: 11.1
Status: NEW
Severity: normal
Priority: P2
Component: rust
Assignee: unassigned at sourceware dot org
Reporter: tromey at sourceware dot org
Target Milestone: ---
The rust compiler is going to start emitting
the char type using DW_ATE_UTF.
See https://github.com/rust-lang/rust/pull/89887
This PR tracks this for the 11.x branch so that
we can backport the patch.
--
You are receiving this mail because:
You are on the CC list for the bug.
^ permalink raw reply [flat|nested] 6+ messages in thread
* [Bug rust/28637] Rust characters will be encoded using DW_ATE_UTF
2021-11-29 18:09 [Bug rust/28637] New: Rust characters will be encoded using DW_ATE_UTF tromey at sourceware dot org
@ 2021-11-29 18:09 ` tromey at sourceware dot org
2021-11-29 20:25 ` cvs-commit at gcc dot gnu.org
` (3 subsequent siblings)
4 siblings, 0 replies; 6+ messages in thread
From: tromey at sourceware dot org @ 2021-11-29 18:09 UTC (permalink / raw)
To: gdb-prs
https://sourceware.org/bugzilla/show_bug.cgi?id=28637
Tom Tromey <tromey at sourceware dot org> changed:
What |Removed |Added
----------------------------------------------------------------------------
Assignee|unassigned at sourceware dot org |tromey at sourceware dot org
--
You are receiving this mail because:
You are on the CC list for the bug.
^ permalink raw reply [flat|nested] 6+ messages in thread
* [Bug rust/28637] Rust characters will be encoded using DW_ATE_UTF
2021-11-29 18:09 [Bug rust/28637] New: Rust characters will be encoded using DW_ATE_UTF tromey at sourceware dot org
2021-11-29 18:09 ` [Bug rust/28637] " tromey at sourceware dot org
@ 2021-11-29 20:25 ` cvs-commit at gcc dot gnu.org
2021-11-29 20:30 ` tromey at sourceware dot org
` (2 subsequent siblings)
4 siblings, 0 replies; 6+ messages in thread
From: cvs-commit at gcc dot gnu.org @ 2021-11-29 20:25 UTC (permalink / raw)
To: gdb-prs
https://sourceware.org/bugzilla/show_bug.cgi?id=28637
--- Comment #1 from cvs-commit at gcc dot gnu.org <cvs-commit at gcc dot gnu.org> ---
The master branch has been updated by Tom Tromey <tromey@sourceware.org>:
https://sourceware.org/git/gitweb.cgi?p=binutils-gdb.git;h=1c0e43634cfdd0ad7ef9ac3dd7d208dddeb80f5e
commit 1c0e43634cfdd0ad7ef9ac3dd7d208dddeb80f5e
Author: Tom Tromey <tom@tromey.com>
Date: Sun Oct 31 10:34:50 2021 -0600
Allow DW_ATE_UTF for Rust characters
The Rust compiler plans to change the encoding of a Rust 'char' type
to use DW_ATE_UTF. You can see the discussion here:
https://github.com/rust-lang/rust/pull/89887
However, this fails in gdb. I looked into this, and it turns out that
the handling of DW_ATE_UTF is currently fairly specific to C++. In
particular, the code here assumes the C++ type names, and it creates
an integer type.
This comes from commit 53e710acd ("GDB thinks char16_t and char32_t
are signed in C++"). The message says:
Both places need fixing. But since I couldn't tell why dwarf2read.c
needs to create a new type, I've made it use the per-arch built-in
types instead, so that the types are only created once per arch
instead of once per objfile. That seems to work fine.
... which is fine, but it seems to me that it's also correct to make a
new character type; and this approach is better because it preserves
the type name as well. This does use more memory, but first we
shouldn't be too concerned about the memory use of types coming from
debuginfo; and second, if we are, we should implement type interning
anyway.
Changing this code to use a character type revealed a couple of
oddities in the C/C++ handling of TYPE_CODE_CHAR. This patch fixes
these as well.
I filed PR rust/28637 for this issue, so that this patch can be
backported to the gdb 11 branch.
--
You are receiving this mail because:
You are on the CC list for the bug.
^ permalink raw reply [flat|nested] 6+ messages in thread
* [Bug rust/28637] Rust characters will be encoded using DW_ATE_UTF
2021-11-29 18:09 [Bug rust/28637] New: Rust characters will be encoded using DW_ATE_UTF tromey at sourceware dot org
2021-11-29 18:09 ` [Bug rust/28637] " tromey at sourceware dot org
2021-11-29 20:25 ` cvs-commit at gcc dot gnu.org
@ 2021-11-29 20:30 ` tromey at sourceware dot org
2021-11-29 20:45 ` cvs-commit at gcc dot gnu.org
2021-11-29 20:46 ` tromey at sourceware dot org
4 siblings, 0 replies; 6+ messages in thread
From: tromey at sourceware dot org @ 2021-11-29 20:30 UTC (permalink / raw)
To: gdb-prs
https://sourceware.org/bugzilla/show_bug.cgi?id=28637
Tom Tromey <tromey at sourceware dot org> changed:
What |Removed |Added
----------------------------------------------------------------------------
Target Milestone|--- |11.2
--
You are receiving this mail because:
You are on the CC list for the bug.
^ permalink raw reply [flat|nested] 6+ messages in thread
* [Bug rust/28637] Rust characters will be encoded using DW_ATE_UTF
2021-11-29 18:09 [Bug rust/28637] New: Rust characters will be encoded using DW_ATE_UTF tromey at sourceware dot org
` (2 preceding siblings ...)
2021-11-29 20:30 ` tromey at sourceware dot org
@ 2021-11-29 20:45 ` cvs-commit at gcc dot gnu.org
2021-11-29 20:46 ` tromey at sourceware dot org
4 siblings, 0 replies; 6+ messages in thread
From: cvs-commit at gcc dot gnu.org @ 2021-11-29 20:45 UTC (permalink / raw)
To: gdb-prs
https://sourceware.org/bugzilla/show_bug.cgi?id=28637
--- Comment #2 from cvs-commit at gcc dot gnu.org <cvs-commit at gcc dot gnu.org> ---
The gdb-11-branch branch has been updated by Tom Tromey
<tromey@sourceware.org>:
https://sourceware.org/git/gitweb.cgi?p=binutils-gdb.git;h=29b161c9be240da341910f0206ffdd881daacd96
commit 29b161c9be240da341910f0206ffdd881daacd96
Author: Tom Tromey <tom@tromey.com>
Date: Sun Oct 31 10:34:50 2021 -0600
Allow DW_ATE_UTF for Rust characters
The Rust compiler plans to change the encoding of a Rust 'char' type
to use DW_ATE_UTF. You can see the discussion here:
https://github.com/rust-lang/rust/pull/89887
However, this fails in gdb. I looked into this, and it turns out that
the handling of DW_ATE_UTF is currently fairly specific to C++. In
particular, the code here assumes the C++ type names, and it creates
an integer type.
This comes from commit 53e710acd ("GDB thinks char16_t and char32_t
are signed in C++"). The message says:
Both places need fixing. But since I couldn't tell why dwarf2read.c
needs to create a new type, I've made it use the per-arch built-in
types instead, so that the types are only created once per arch
instead of once per objfile. That seems to work fine.
... which is fine, but it seems to me that it's also correct to make a
new character type; and this approach is better because it preserves
the type name as well. This does use more memory, but first we
shouldn't be too concerned about the memory use of types coming from
debuginfo; and second, if we are, we should implement type interning
anyway.
Changing this code to use a character type revealed a couple of
oddities in the C/C++ handling of TYPE_CODE_CHAR. This patch fixes
these as well.
I filed PR rust/28637 for this issue, so that this patch can be
backported to the gdb 11 branch.
(cherry picked from commit 1c0e43634cfdd0ad7ef9ac3dd7d208dddeb80f5e)
--
You are receiving this mail because:
You are on the CC list for the bug.
^ permalink raw reply [flat|nested] 6+ messages in thread
* [Bug rust/28637] Rust characters will be encoded using DW_ATE_UTF
2021-11-29 18:09 [Bug rust/28637] New: Rust characters will be encoded using DW_ATE_UTF tromey at sourceware dot org
` (3 preceding siblings ...)
2021-11-29 20:45 ` cvs-commit at gcc dot gnu.org
@ 2021-11-29 20:46 ` tromey at sourceware dot org
4 siblings, 0 replies; 6+ messages in thread
From: tromey at sourceware dot org @ 2021-11-29 20:46 UTC (permalink / raw)
To: gdb-prs
https://sourceware.org/bugzilla/show_bug.cgi?id=28637
Tom Tromey <tromey at sourceware dot org> changed:
What |Removed |Added
----------------------------------------------------------------------------
Resolution|--- |FIXED
Status|NEW |RESOLVED
--- Comment #3 from Tom Tromey <tromey at sourceware dot org> ---
Fixed now.
--
You are receiving this mail because:
You are on the CC list for the bug.
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2021-11-29 20:46 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-11-29 18:09 [Bug rust/28637] New: Rust characters will be encoded using DW_ATE_UTF tromey at sourceware dot org
2021-11-29 18:09 ` [Bug rust/28637] " tromey at sourceware dot org
2021-11-29 20:25 ` cvs-commit at gcc dot gnu.org
2021-11-29 20:30 ` tromey at sourceware dot org
2021-11-29 20:45 ` cvs-commit at gcc dot gnu.org
2021-11-29 20:46 ` tromey at sourceware dot org
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).