public inbox for libc-alpha@sourceware.org
 help / color / mirror / Atom feed
From: наб <nabijaczleweli@nabijaczleweli.xyz>
To: Bruno Haible <bruno@clisp.org>
Cc: libc-alpha@sourceware.org
Subject: Re: [PATCH v16] POSIX locale covers every byte [BZ# 29511]
Date: Wed, 12 Jul 2023 22:22:10 +0200	[thread overview]
Message-ID: <6row5r5lyev35lddwxlukyopx5j5faxaqlw2audnolca55zrhc@fvrom2u2wmaj> (raw)
In-Reply-To: <4881032.NnENhoQgcM@nimes>

[-- Attachment #1: Type: text/plain, Size: 1569 bytes --]

Hi!

On Wed, Jul 12, 2023 at 09:44:26PM +0200, Bruno Haible wrote:
> Regarding the mapping of the bytes 0x80..0xFF:
> > By strategically picking c=<UDC00> we land at the same point of the
> > Unicode Low Surrogate Area at DC00-DCFF, described as
> >   > Isolated surrogate code points have no interpretation;
> >   > consequently, no character code charts or names lists
> >   > are provided for this range.
> > as the Python UTF-8 errors=surrogateescape encoding.
> musl libc maps the bytes 0x80..0xFF to U+DF80..U+DFFF. [1][2]
> 
> I think it is more useful to avoid an inconsistency between glibc and
> musl libc, than to be consistent with what a particular user-space
> program (Python) does.
> 
> How about mapping the bytes 0x80..0xFF to U+DF80..U+DFFF, like musl libc
> does?
That's what I had done originally (and citing the same exact reasons!),
but changed it in v10
  https://sourceware.org/pipermail/libc-alpha/2023-April/147652.html
because Florian likes it better. He forwarded it to musl@
  https://www.openwall.com/lists/musl/2022/11/10/1
in v6
  https://sourceware.org/pipermail/libc-alpha/2022-December/143690.html

In short: python uses the DCxx range, musl put it at DFxx for no
particular reason but decided to not move it because that would imply
some sort of stability or semantic meaning.

I personally like DFxx more but don't really care,
so Reviewer's Privilege of Mostly-Arbitrary Design Choice.

I can change it back for the same reason, but I'd rather do it, uh.
Once. Don't wanna be ping-ponging this.

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 833 bytes --]

  reply	other threads:[~2023-07-12 20:22 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-07-12 19:44 Bruno Haible
2023-07-12 20:22 ` наб [this message]
2023-07-12 21:08   ` Bruno Haible
  -- strict thread matches above, loose matches on Subject: below --
2023-07-12 21:49 Bruno Haible
2023-07-12 23:51 ` наб
2023-07-03 15:39 наб

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=6row5r5lyev35lddwxlukyopx5j5faxaqlw2audnolca55zrhc@fvrom2u2wmaj \
    --to=nabijaczleweli@nabijaczleweli.xyz \
    --cc=bruno@clisp.org \
    --cc=libc-alpha@sourceware.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).