From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: by sourceware.org (Postfix, from userid 2155) id 3BB983858C54; Fri, 14 Apr 2023 20:20:16 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 3BB983858C54 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cygwin.com; s=default; t=1681503616; bh=jQoia/eEf5n/OLrAnTjYMPCkpbF7njnzr2FZTuKzUTM=; h=Date:From:To:Cc:Subject:Reply-To:References:In-Reply-To:From; b=T9Lp7kzLScSmHJQBdQTZV+HCG5A/BbTWxUiPvVI9A4jcIdKx53VtCQGO6uQR62MyZ GEKP/VXNT4ESLl/IhJZ6HJJ4Rc2Y7sTEwKp+nZAmcspBdCZMKOmyNX2GgPhE1IdK7C y1um8x48h4LCoOQtcmeFuMVraUm2vixnZzKSSgv8= Received: by calimero.vinschen.de (Postfix, from userid 500) id 93536A81B29; Fri, 14 Apr 2023 22:20:14 +0200 (CEST) Date: Fri, 14 Apr 2023 22:20:14 +0200 From: Corinna Vinschen To: Brian Inglis via Cygwin Cc: Brian Inglis , Gionatan Danti Subject: Re: Can not stat file with utf char U+F020 Message-ID: Reply-To: cygwin@cygwin.com Mail-Followup-To: Brian Inglis via Cygwin , Brian Inglis , Gionatan Danti References: <992b3c28d7f1cfc17f7c9bb47b53f770@assyoma.it> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: List-Id: On Apr 14 13:54, Brian Inglis via Cygwin wrote: > On 2023-04-14 13:00, Corinna Vinschen via Cygwin wrote: > > On Apr 14 19:53, Gionatan Danti via Cygwin wrote: > > > [1] https://sourceware.org/legacy-ml/cygwin/2009-11/msg00043.html > > > While this patch would have fixed your problem, a later followup patch > > broke your usage of U+F020 (space replacement) and, FWIW, of U+F02E > > (dot replacement) again: > > https://cygwin.com/cgit/newlib-cygwin/commit/?id=8802178fddfd > > This was done to accomodate filesystems implementing the idiotic > > approach to support only DOS filenames, i.e. not allowing leading or > > trailing spaces and not allowing trailing dots. These are Netapp and > > Novell Netware filesystems. See the last paragraph of > > https://cygwin.com/cygwin-ug-net/using-specialnames.html#pathnames-specialchars > > Any chance you can just rename the files? > > UCSUR Under-ConScript Unicode Registry and its predecessor ConScript Unicode > Registry CSUR > > https://www.kreativekorp.com/ucsur/ > > http://www.evertype.com/standards/csur/ > > unofficially register Unicode PUA glyphs for academic, artificial, > constructed, historical, invented, and minority language scripts, some of > which have made it into Unicode e.g. > > Script CSUR Unicode > PHAISTOS DISC U+E6D0-U+E6FF U+101D0-U+101DF > SHAVIAN U+E700-U+E72F U+10450-U+1047F > DESERET U+E830-U+E88F U+10400-U+1044F > > and maintain their own Unidata e.g. > > https://www.kreativekorp.com/ucsur/UNIDATA/Blocks.txt > > and some Unicode fonts have -CSUR addition files (like -Italic etc.) that > support BMP and SMP PUA glyphs. > > For Cygwin purposes: > > F000−F7FF unassigned Reserved for hacks and corporate use > > so Cygwin's special Windows file name characters mappings are clear: > For completeness sake, starting with commit 8802178fddfd: F020 > F022 " > F02A * F02E . > F03A : > F03C < > F03E > > F03F ? > F07C | Corinna