From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: by sourceware.org (Postfix, from userid 2155) id 8153D3858D20; Fri, 14 Apr 2023 19:00:27 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 8153D3858D20 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cygwin.com; s=default; t=1681498827; bh=47yOtpEoJ5x3yQ6uEwdd7PbBSSNvPC2z1ZE9UgG8Tlc=; h=Date:From:To:Cc:Subject:Reply-To:References:In-Reply-To:From; b=TGUv9FM3JAcySqcAjYHS04LcdLia7sPLVyabo0FVPCay8AkkBdELkTUExm5aICwP8 zHNJ316hEbLnwlpmrl+ye2AzHWRQPzwdRHDaaZdnACJx8rPDn6aXtefphZZv0nm4yh X+6sSfSu1tBSdofkh+bRgtrEh7VieW4U3L5IVbtE= Received: by calimero.vinschen.de (Postfix, from userid 500) id 85751A81474; Fri, 14 Apr 2023 21:00:25 +0200 (CEST) Date: Fri, 14 Apr 2023 21:00:25 +0200 From: Corinna Vinschen To: Gionatan Danti Cc: cygwin@cygwin.com Subject: Re: Can not stat file with utf char U+F020 Message-ID: Reply-To: cygwin@cygwin.com Mail-Followup-To: Gionatan Danti , cygwin@cygwin.com References: <992b3c28d7f1cfc17f7c9bb47b53f770@assyoma.it> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <992b3c28d7f1cfc17f7c9bb47b53f770@assyoma.it> List-Id: On Apr 14 19:53, Gionatan Danti via Cygwin wrote: > Dear list, > I have an issue with unreadable files with contain utf char U+F020 (which > appear as "middle dot with some space after") in their name. > > stat on such a file results in "no such file or directory" > > From here [1] it seems that a patch was contemplated many years ago, but I > don't know its status now. > > Any ideas or workaround? There's no (good) solution from inside Cygwin. Keep in mind that the Unicode area from U+E000 up to U+F8FF is called "Private Use Area". So none of the chars are mapped into any singlebyte, doublebyte, or multibyte charset. Typically we don't expect that filenames contain any of these chars, and we're only using a very small subset of them for our own, dubious purposes anyway: https://cygwin.com/cygwin-ug-net/using-specialnames.html#pathnames-specialchars > [1] https://sourceware.org/legacy-ml/cygwin/2009-11/msg00043.html While this patch would have fixed your problem, a later followup patch broke your usage of U+F020 (space replacement) and, FWIW, of U+F02E (dot replacement) again: https://cygwin.com/cgit/newlib-cygwin/commit/?id=8802178fddfd This was done to accomodate filesystems implementing the idiotic approach to support only DOS filenames, i. e., not allowing leading or trailing spaces and not allowing trailing dots. These are Netapp and Novell Netware filesystems. See the last paragraph of https://cygwin.com/cygwin-ug-net/using-specialnames.html#pathnames-specialchars Any chance you can just rename the files? Corinna