On Thu, 3 Dec 2009, Andy Koppe wrote: > 2009/12/3 Linda Walsh: >> C.UTF_8 doesn't exist. ... >> You can't have "C" and "UTF-8", because C means no encoding (default). >> UTF-8 IS an encoding, so they are mutually exclusive. > > From http://www.opengroup.org/onlinepubs/009695399/basedefs/xbd_chap07.html, > ยง7.2: > > "The tables in Locale Definition describe the characteristics and > behavior of the POSIX locale for data consisting entirely of > characters from the portable character set and the control character > set. For other characters, the behavior is unspecified." > > This means that characters 0..127 have to be treated as ASCII, but > beyond that an implementation can do what it wants. And on Cygwin 1.7, > plain "C" actually does imply UTF-8, which happily is > backward-compatible with ASCII. That's an interpretation that so far hasn't been blessed by the standards people. Any discussion of this topic should mention that, as a caveat. ymmv -- Thomas E. Dickey http://invisible-island.net ftp://invisible-island.net