public inbox for cygwin@cygwin.com
 help / color / mirror / Atom feed
From: Corinna Vinschen <corinna-cygwin@cygwin.com>
To: Corinna Vinschen via Cygwin <cygwin@cygwin.com>
Cc: Brian Inglis <Brian.Inglis@shaw.ca>, Bruno Haible <bruno@clisp.org>
Subject: Re: character class "alpha"
Date: Wed, 2 Aug 2023 17:06:25 +0200	[thread overview]
Message-ID: <ZMpw8VsYzbTsKb2t@calimero.vinschen.de> (raw)
In-Reply-To: <ZMoMKj9DHrf1LDnc@calimero.vinschen.de>

On Aug  2 09:56, Corinna Vinschen via Cygwin wrote:
> On Aug  1 10:29, Brian Inglis via Cygwin wrote:
> > On 2023-07-31 15:12, Corinna Vinschen via Cygwin wrote:
> > > Hi Bruno,
> > > 
> > > On Jul 31 20:43, Bruno Haible via Cygwin wrote:
> > > > Corinna Vinschen wrote:
> > > > > there are more of those expressions which are disabled on glibc and
> > > > > fail on Cygwin, for instance in test-c32iscntrl.c.  Maybe it's actually
> > > > > the better idea to disable them on Cygwin, too, rather than to change
> > > > > a working system...
> > > > 
> > > > Sure. There is no standard how to map the Unicode properties to POSIX
> > > > character classes. Other than the mentioned ISO C constraints for
> > > > 'digit' and 'xdigit' and a few POSIX constraints, you are free to
> > > > map them as you like. For glibc and gnulib, I mapped them in a way
> > > > that seemed to make most sense for applications. But different
> > > > people might come to different meanings of "make sense".
> > > 
> > > Ok, so I just pushed a patchset to Cygwin git, which should make GB18030
> > > support actually work.
> > > 
> > > Also, the C11 functions c16rtomb, c32rtomb, mbrtoc16, mbrtoc32 are now
> > > implemented in Cygwin and a uchar.h header exists now, too.
> > > 
> > > Assuming all gnulib tests disabled for GLibc in
> > > 
> > >    test-c32isalpha.c
> > >    test-c32iscntrl.c
> > >    test-c32isprint.c
> > >    test-c32isgraph.c
> > >    test-c32ispunct.c
> > >    test-c32islower.c
> > > 
> > > will be disabled for Cygwin as well, all gb18030 and c32 tests in gnulib
> > > work as desired now.
> > 
> > 	https://www.iso.org/standard/86539.html		[ISO/IEC/IEEE 9945 CD]
> > 
> > Draft POSIX 2023 SUS V5 Issue 8 D3 CB2.1 proposes the following POSIX
> > Subprofiling Option Group: POSIX_C_LANG_UCHAR: ISO C Unicode Utilities.
> > 
> > 	https://www.iso.org/standard/82075.html		[ISO/IEC 9899 DIS]
> > 
> > Draft Standard C 2023 is being voted on as of 2023-07-14, and if no
> > technical issues arise requiring tweaks, will become the new standard, in
> > which Unicode utilities <uchar.h> has some additions which you may wish to
> > add; from:
> 
> Maybe at one point, but nobody keeps you from sending patches :)

Never mind, had a bit of time.

I fixed the uchar.h header and implemented c8rtomb und mbrtoc8.
Still needs testing.  Does anybody know of an easily accessible
testsuite testing these functions?

However, I did not define __STDC_VERSION_UCHAR_H__ yet.  I wasn't sure
my uchar.h is compliant, and Glibc doesn't define that macro yet,
either.


Corinna

  reply	other threads:[~2023-08-02 15:06 UTC|newest]

Thread overview: 32+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-07-27 10:15 fnmatch improvements Bruno Haible
2023-07-27 18:24 ` Corinna Vinschen
2023-07-27 19:05   ` Corinna Vinschen
2023-07-27 20:25     ` Brian Inglis
2023-07-27 21:22       ` Bruno Haible
2023-07-27 22:17         ` Brian Inglis
2023-07-28  9:00           ` Corinna Vinschen
2023-07-28  9:53             ` Corinna Vinschen
2023-07-27 21:40     ` Bruno Haible
2023-07-28  8:53       ` Corinna Vinschen
2023-07-28 10:56         ` Bruno Haible
2023-07-28 11:14           ` Corinna Vinschen
2023-07-28 18:59           ` Corinna Vinschen
2023-07-28 19:33             ` Bruno Haible
2023-07-28 19:54             ` GB18030 locale Bruno Haible
2023-07-29  9:23               ` Corinna Vinschen
2023-07-29  9:53                 ` Bruno Haible
2023-07-31 10:07                   ` Corinna Vinschen
2023-07-31 13:38                     ` Corinna Vinschen
2023-07-31 14:06                       ` character class "alpha" Bruno Haible
2023-07-31 17:46                         ` Corinna Vinschen
2023-07-31 18:20                           ` Corinna Vinschen
2023-07-31 18:43                             ` Bruno Haible
2023-07-31 21:12                               ` Corinna Vinschen
2023-08-01 16:29                                 ` Brian Inglis
2023-08-02  7:56                                   ` Corinna Vinschen
2023-08-02 15:06                                     ` Corinna Vinschen [this message]
2023-07-31 21:13                               ` Brian Inglis
2023-07-31 21:37                                 ` Bruno Haible
2023-07-28 11:12         ` fnmatch improvements Corinna Vinschen
2023-07-28 11:22           ` Bruno Haible
2023-07-28 21:42           ` Bill Stewart

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=ZMpw8VsYzbTsKb2t@calimero.vinschen.de \
    --to=corinna-cygwin@cygwin.com \
    --cc=Brian.Inglis@shaw.ca \
    --cc=bruno@clisp.org \
    --cc=cygwin@cygwin.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).