public inbox for cygwin@cygwin.com
 help / color / mirror / Atom feed
From: Tony Cook <tony@develop-help.com>
To: cygwin@cygwin.com
Subject: cygwin 2.4.1: broken ps_AF and ps_AF.utf8 locales
Date: Tue, 02 Feb 2016 04:33:00 -0000	[thread overview]
Message-ID: <20160202043247.GP31193@mars.tony.develop-help.com> (raw)

Hi list,

Simplified to a C program below, calls to sprintf() under the ps_AF
and ps_AF.utf8 locales are returning a value that doesn't match the
length of the formatted string:

tony@phobos ~
$ cat ps_AF.c
#include <stdio.h>
#include <locale.h>
#include <string.h>

int main(int argc, char **argv) {
  char buf[100];
  char *loc = argc > 1 ? argv[1] : "ps_AF";
  const char *real_loc;
  if (!(real_loc = setlocale(LC_NUMERIC, loc))) {
    perror("setlocale");
    return 1;
  }
  printf("locale %s\n", real_loc);
  size_t len = sprintf(buf, "%g", 2.34);
  printf("len %zu\n", len);
  printf("strlen %zu\n", strlen(buf));

  return 0;
}

tony@phobos ~
$ gcc -ops_AF.exe ps_AF.c

tony@phobos ~
$ ./ps_AF
locale ps_AF
len 4
strlen 5

tony@phobos ~
$ ./ps_AF ps_AF.utf8
locale ps_AF.utf8
len 4
strlen 5

tony@phobos ~
$ ./ps_AF en_US.utf8
locale en_US.utf8
len 4
strlen 4

tony@phobos ~
$ uname -a
CYGWIN_NT-6.1-WOW phobos 2.4.1(0.293/5/3) 2016-01-24 11:24 i686 Cygwin

The man pages and C standard could be read as sprintf() returning the
number of multi-byte characters, but if cygwin is intended to follow
Linux behaviour:

tony@mars:~/play$ gcc -ops_AF ps_AF.c 
tony@mars:~/play$ ./ps_AF
locale ps_AF
len 5
strlen 5
tony@mars:~/play$ ./ps_AF ps_AF.utf8
locale ps_AF.utf8
len 5
strlen 5
tony@mars:~/play$ ./ps_AF en_AU.utf8
locale en_AU.utf8
len 4
strlen 4
tony@mars:~/play$ uname -a
Linux mars 3.16.0-4-amd64 #1 SMP Debian 3.16.7-ckt20-1+deb8u3 (2016-01-17) x86_64 GNU/Linux

(and the decimal point under ps_AF on Linux is multi-byte, character
0x66b or ARABIC DECIMAL SEPARATOR.)

POSIX is less confusing and specifies:

Upon successful completion, the sprintf() function shall return the
number of bytes written to s, excluding the terminating null byte.

(http://pubs.opengroup.org/onlinepubs/9699919799/functions/fprintf.html)

Tony


--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

             reply	other threads:[~2016-02-02  4:33 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-02-02  4:33 Tony Cook [this message]
2016-02-02  8:42 ` Achim Gratz
2016-02-08 12:37 ` Corinna Vinschen
2016-02-02 22:22 Tony Cook

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20160202043247.GP31193@mars.tony.develop-help.com \
    --to=tony@develop-help.com \
    --cc=cygwin@cygwin.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).