public inbox for gdb-prs@sourceware.org
help / color / mirror / Atom feed
* [Bug rust/28637] New: Rust characters will be encoded using DW_ATE_UTF
@ 2021-11-29 18:09 tromey at sourceware dot org
  2021-11-29 18:09 ` [Bug rust/28637] " tromey at sourceware dot org
                   ` (4 more replies)
  0 siblings, 5 replies; 6+ messages in thread
From: tromey at sourceware dot org @ 2021-11-29 18:09 UTC (permalink / raw)
  To: gdb-prs

https://sourceware.org/bugzilla/show_bug.cgi?id=28637

            Bug ID: 28637
           Summary: Rust characters will be encoded using DW_ATE_UTF
           Product: gdb
           Version: 11.1
            Status: NEW
          Severity: normal
          Priority: P2
         Component: rust
          Assignee: unassigned at sourceware dot org
          Reporter: tromey at sourceware dot org
  Target Milestone: ---

The rust compiler is going to start emitting
the char type using DW_ATE_UTF.
See https://github.com/rust-lang/rust/pull/89887
This PR tracks this for the 11.x branch so that
we can backport the patch.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug rust/28637] Rust characters will be encoded using DW_ATE_UTF
  2021-11-29 18:09 [Bug rust/28637] New: Rust characters will be encoded using DW_ATE_UTF tromey at sourceware dot org
@ 2021-11-29 18:09 ` tromey at sourceware dot org
  2021-11-29 20:25 ` cvs-commit at gcc dot gnu.org
                   ` (3 subsequent siblings)
  4 siblings, 0 replies; 6+ messages in thread
From: tromey at sourceware dot org @ 2021-11-29 18:09 UTC (permalink / raw)
  To: gdb-prs

https://sourceware.org/bugzilla/show_bug.cgi?id=28637

Tom Tromey <tromey at sourceware dot org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Assignee|unassigned at sourceware dot org   |tromey at sourceware dot org

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug rust/28637] Rust characters will be encoded using DW_ATE_UTF
  2021-11-29 18:09 [Bug rust/28637] New: Rust characters will be encoded using DW_ATE_UTF tromey at sourceware dot org
  2021-11-29 18:09 ` [Bug rust/28637] " tromey at sourceware dot org
@ 2021-11-29 20:25 ` cvs-commit at gcc dot gnu.org
  2021-11-29 20:30 ` tromey at sourceware dot org
                   ` (2 subsequent siblings)
  4 siblings, 0 replies; 6+ messages in thread
From: cvs-commit at gcc dot gnu.org @ 2021-11-29 20:25 UTC (permalink / raw)
  To: gdb-prs

https://sourceware.org/bugzilla/show_bug.cgi?id=28637

--- Comment #1 from cvs-commit at gcc dot gnu.org <cvs-commit at gcc dot gnu.org> ---
The master branch has been updated by Tom Tromey <tromey@sourceware.org>:

https://sourceware.org/git/gitweb.cgi?p=binutils-gdb.git;h=1c0e43634cfdd0ad7ef9ac3dd7d208dddeb80f5e

commit 1c0e43634cfdd0ad7ef9ac3dd7d208dddeb80f5e
Author: Tom Tromey <tom@tromey.com>
Date:   Sun Oct 31 10:34:50 2021 -0600

    Allow DW_ATE_UTF for Rust characters

    The Rust compiler plans to change the encoding of a Rust 'char' type
    to use DW_ATE_UTF.  You can see the discussion here:

        https://github.com/rust-lang/rust/pull/89887

    However, this fails in gdb.  I looked into this, and it turns out that
    the handling of DW_ATE_UTF is currently fairly specific to C++.  In
    particular, the code here assumes the C++ type names, and it creates
    an integer type.

    This comes from commit 53e710acd ("GDB thinks char16_t and char32_t
    are signed in C++").  The message says:

        Both places need fixing.  But since I couldn't tell why dwarf2read.c
        needs to create a new type, I've made it use the per-arch built-in
        types instead, so that the types are only created once per arch
        instead of once per objfile.  That seems to work fine.

    ... which is fine, but it seems to me that it's also correct to make a
    new character type; and this approach is better because it preserves
    the type name as well.  This does use more memory, but first we
    shouldn't be too concerned about the memory use of types coming from
    debuginfo; and second, if we are, we should implement type interning
    anyway.

    Changing this code to use a character type revealed a couple of
    oddities in the C/C++ handling of TYPE_CODE_CHAR.  This patch fixes
    these as well.

    I filed PR rust/28637 for this issue, so that this patch can be
    backported to the gdb 11 branch.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug rust/28637] Rust characters will be encoded using DW_ATE_UTF
  2021-11-29 18:09 [Bug rust/28637] New: Rust characters will be encoded using DW_ATE_UTF tromey at sourceware dot org
  2021-11-29 18:09 ` [Bug rust/28637] " tromey at sourceware dot org
  2021-11-29 20:25 ` cvs-commit at gcc dot gnu.org
@ 2021-11-29 20:30 ` tromey at sourceware dot org
  2021-11-29 20:45 ` cvs-commit at gcc dot gnu.org
  2021-11-29 20:46 ` tromey at sourceware dot org
  4 siblings, 0 replies; 6+ messages in thread
From: tromey at sourceware dot org @ 2021-11-29 20:30 UTC (permalink / raw)
  To: gdb-prs

https://sourceware.org/bugzilla/show_bug.cgi?id=28637

Tom Tromey <tromey at sourceware dot org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|---                         |11.2

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug rust/28637] Rust characters will be encoded using DW_ATE_UTF
  2021-11-29 18:09 [Bug rust/28637] New: Rust characters will be encoded using DW_ATE_UTF tromey at sourceware dot org
                   ` (2 preceding siblings ...)
  2021-11-29 20:30 ` tromey at sourceware dot org
@ 2021-11-29 20:45 ` cvs-commit at gcc dot gnu.org
  2021-11-29 20:46 ` tromey at sourceware dot org
  4 siblings, 0 replies; 6+ messages in thread
From: cvs-commit at gcc dot gnu.org @ 2021-11-29 20:45 UTC (permalink / raw)
  To: gdb-prs

https://sourceware.org/bugzilla/show_bug.cgi?id=28637

--- Comment #2 from cvs-commit at gcc dot gnu.org <cvs-commit at gcc dot gnu.org> ---
The gdb-11-branch branch has been updated by Tom Tromey
<tromey@sourceware.org>:

https://sourceware.org/git/gitweb.cgi?p=binutils-gdb.git;h=29b161c9be240da341910f0206ffdd881daacd96

commit 29b161c9be240da341910f0206ffdd881daacd96
Author: Tom Tromey <tom@tromey.com>
Date:   Sun Oct 31 10:34:50 2021 -0600

    Allow DW_ATE_UTF for Rust characters

    The Rust compiler plans to change the encoding of a Rust 'char' type
    to use DW_ATE_UTF.  You can see the discussion here:

        https://github.com/rust-lang/rust/pull/89887

    However, this fails in gdb.  I looked into this, and it turns out that
    the handling of DW_ATE_UTF is currently fairly specific to C++.  In
    particular, the code here assumes the C++ type names, and it creates
    an integer type.

    This comes from commit 53e710acd ("GDB thinks char16_t and char32_t
    are signed in C++").  The message says:

        Both places need fixing.  But since I couldn't tell why dwarf2read.c
        needs to create a new type, I've made it use the per-arch built-in
        types instead, so that the types are only created once per arch
        instead of once per objfile.  That seems to work fine.

    ... which is fine, but it seems to me that it's also correct to make a
    new character type; and this approach is better because it preserves
    the type name as well.  This does use more memory, but first we
    shouldn't be too concerned about the memory use of types coming from
    debuginfo; and second, if we are, we should implement type interning
    anyway.

    Changing this code to use a character type revealed a couple of
    oddities in the C/C++ handling of TYPE_CODE_CHAR.  This patch fixes
    these as well.

    I filed PR rust/28637 for this issue, so that this patch can be
    backported to the gdb 11 branch.

    (cherry picked from commit 1c0e43634cfdd0ad7ef9ac3dd7d208dddeb80f5e)

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug rust/28637] Rust characters will be encoded using DW_ATE_UTF
  2021-11-29 18:09 [Bug rust/28637] New: Rust characters will be encoded using DW_ATE_UTF tromey at sourceware dot org
                   ` (3 preceding siblings ...)
  2021-11-29 20:45 ` cvs-commit at gcc dot gnu.org
@ 2021-11-29 20:46 ` tromey at sourceware dot org
  4 siblings, 0 replies; 6+ messages in thread
From: tromey at sourceware dot org @ 2021-11-29 20:46 UTC (permalink / raw)
  To: gdb-prs

https://sourceware.org/bugzilla/show_bug.cgi?id=28637

Tom Tromey <tromey at sourceware dot org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
         Resolution|---                         |FIXED
             Status|NEW                         |RESOLVED

--- Comment #3 from Tom Tromey <tromey at sourceware dot org> ---
Fixed now.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2021-11-29 20:46 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-11-29 18:09 [Bug rust/28637] New: Rust characters will be encoded using DW_ATE_UTF tromey at sourceware dot org
2021-11-29 18:09 ` [Bug rust/28637] " tromey at sourceware dot org
2021-11-29 20:25 ` cvs-commit at gcc dot gnu.org
2021-11-29 20:30 ` tromey at sourceware dot org
2021-11-29 20:45 ` cvs-commit at gcc dot gnu.org
2021-11-29 20:46 ` tromey at sourceware dot org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).