From: Tony Cook <tony@develop-help.com>
To: cygwin@cygwin.com
Subject: cygwin 2.4.1: broken ps_AF and ps_AF.utf8 locales
Date: Tue, 02 Feb 2016 04:33:00 -0000 [thread overview]
Message-ID: <20160202043247.GP31193@mars.tony.develop-help.com> (raw)
Hi list,
Simplified to a C program below, calls to sprintf() under the ps_AF
and ps_AF.utf8 locales are returning a value that doesn't match the
length of the formatted string:
tony@phobos ~
$ cat ps_AF.c
#include <stdio.h>
#include <locale.h>
#include <string.h>
int main(int argc, char **argv) {
char buf[100];
char *loc = argc > 1 ? argv[1] : "ps_AF";
const char *real_loc;
if (!(real_loc = setlocale(LC_NUMERIC, loc))) {
perror("setlocale");
return 1;
}
printf("locale %s\n", real_loc);
size_t len = sprintf(buf, "%g", 2.34);
printf("len %zu\n", len);
printf("strlen %zu\n", strlen(buf));
return 0;
}
tony@phobos ~
$ gcc -ops_AF.exe ps_AF.c
tony@phobos ~
$ ./ps_AF
locale ps_AF
len 4
strlen 5
tony@phobos ~
$ ./ps_AF ps_AF.utf8
locale ps_AF.utf8
len 4
strlen 5
tony@phobos ~
$ ./ps_AF en_US.utf8
locale en_US.utf8
len 4
strlen 4
tony@phobos ~
$ uname -a
CYGWIN_NT-6.1-WOW phobos 2.4.1(0.293/5/3) 2016-01-24 11:24 i686 Cygwin
The man pages and C standard could be read as sprintf() returning the
number of multi-byte characters, but if cygwin is intended to follow
Linux behaviour:
tony@mars:~/play$ gcc -ops_AF ps_AF.c
tony@mars:~/play$ ./ps_AF
locale ps_AF
len 5
strlen 5
tony@mars:~/play$ ./ps_AF ps_AF.utf8
locale ps_AF.utf8
len 5
strlen 5
tony@mars:~/play$ ./ps_AF en_AU.utf8
locale en_AU.utf8
len 4
strlen 4
tony@mars:~/play$ uname -a
Linux mars 3.16.0-4-amd64 #1 SMP Debian 3.16.7-ckt20-1+deb8u3 (2016-01-17) x86_64 GNU/Linux
(and the decimal point under ps_AF on Linux is multi-byte, character
0x66b or ARABIC DECIMAL SEPARATOR.)
POSIX is less confusing and specifies:
Upon successful completion, the sprintf() function shall return the
number of bytes written to s, excluding the terminating null byte.
(http://pubs.opengroup.org/onlinepubs/9699919799/functions/fprintf.html)
Tony
--
Problem reports: http://cygwin.com/problems.html
FAQ: http://cygwin.com/faq/
Documentation: http://cygwin.com/docs.html
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
next reply other threads:[~2016-02-02 4:33 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-02-02 4:33 Tony Cook [this message]
2016-02-02 8:42 ` Achim Gratz
2016-02-08 12:37 ` Corinna Vinschen
2016-02-02 22:22 Tony Cook
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20160202043247.GP31193@mars.tony.develop-help.com \
--to=tony@develop-help.com \
--cc=cygwin@cygwin.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).