public inbox for cygwin@cygwin.com
 help / color / mirror / Atom feed
* utf/codepage-conversion on cmd-window
       [not found] <20151006210122.160D1A28.helmut.karlowski@ish.de>
@ 2015-10-06 19:01 ` Helmut Karlowski
  0 siblings, 0 replies; only message in thread
From: Helmut Karlowski @ 2015-10-06 19:01 UTC (permalink / raw)
  To: cygwin; +Cc: cygwin

[-- Attachment #1: Type: text/plain, Size: 1262 bytes --]

I came across a surprising behaviour when a cygwin-process prints to a
windows-console. In cygwin 1.7 (XP) I don't think any conversion took 
place, while now in cygwin 2 (10) cygwin-utf-8 gets converted into the
suitable(?) windows-encoding used in the cmd-window.

I have a file containing german umlaut-characters encoded in utf-8 and
cp850.

When I'm in the cmd-window:

C:\bat>type cmduml.txt
├û├ä├£├Â├ñ├╝├ƒ      -> some cp850-characters with 
high-bit set
ÖÄÜöäüß              -> correct output

C:\bat>\cygwin\bin\cat cmduml.txt
ÖÄÜöäüß               -> utf converted to cp850
¦¦¦¦¦¦¦                            -> ???

C:\bat>chcp
Aktive Codepage: 850.

It makes things easier the way it is now, but I could not find it 
documented. Also I wonder about cygwin's output of the 
cp-850-characters, I'd expect them to be printed unchanged, instead I 
only see grey rectangles.

In the cygwin-window the file is:

/c/bat|17:05:30#od -x cmduml.txt
0000000 96c3 84c3 9cc3 b6c3 a4c3 bcc3 9fc3 990a
0000020 9a8e 8494 e181 0a0d
0000030

I've also attached it.

I hope all is displayed correctly, but it should be easy to reproduce.

-Helmut


[-- Attachment #2: cmduml.txt --]
[-- Type: application/octet-stream, Size: 24 bytes --]

[-- Attachment #3: Type: text/plain, Size: 218 bytes --]

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2015-10-06 19:01 UTC | newest]

Thread overview: (only message) (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <20151006210122.160D1A28.helmut.karlowski@ish.de>
2015-10-06 19:01 ` utf/codepage-conversion on cmd-window Helmut Karlowski

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).