public inbox for cygwin@cygwin.com
 help / color / mirror / Atom feed
* UTF-8 compatibility between Windows and Cygwin
@ 2017-05-25 15:36 Nellis, Kenneth
  2017-05-25 17:34 ` Andrey Repin
  0 siblings, 1 reply; 2+ messages in thread
From: Nellis, Kenneth @ 2017-05-25 15:36 UTC (permalink / raw)
  To: cygwin

I have (BOM-less) UTF-8 text files that I can read fine in 
Cygwin, but not Windows. When I create text files in Windows 
containing non-ASCII characters, I cannot read them in 
Cygwin. I understand why, but wondering the best way to be 
able to share text files across the two environments. 

I'm pretty sure that I want to keep my Cygwin LANG=C.UTF-8 
setting, but wondering what I can do on the Windows side for 
compatibility and what are the side effects. Currently 
Windows's Command Prompt command chcp shows "Active code 
page: 437". (Is that obsolete or even relevant?)

One solution seems to be to put a BOM in each UTF-8 text 
file; then the files read fine in both environments, but 
that's not conveniently accomplished.

--Ken Nellis

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: UTF-8 compatibility between Windows and Cygwin
  2017-05-25 15:36 UTF-8 compatibility between Windows and Cygwin Nellis, Kenneth
@ 2017-05-25 17:34 ` Andrey Repin
  0 siblings, 0 replies; 2+ messages in thread
From: Andrey Repin @ 2017-05-25 17:34 UTC (permalink / raw)
  To: Nellis, Kenneth, cygwin

Greetings, Nellis, Kenneth!

> I have (BOM-less) UTF-8 text files that I can read fine in 
> Cygwin, but not Windows. When I create text files in Windows 
> containing non-ASCII characters, I cannot read them in 
> Cygwin. I understand why, but wondering the best way to be 
> able to share text files across the two environments. 

Please provide `od -t x1z` of the file you are referring to.

> I'm pretty sure that I want to keep my Cygwin LANG=C.UTF-8 
> setting,

Better use real language indication, and configure LC_ according to
preferences.

> but wondering what I can do on the Windows side for
> compatibility and what are the side effects.

On Windows side, you could actually use UTF-8.
Without seeing the requested dump, I can only guess, though.

> Currently
> Windows's Command Prompt command chcp shows "Active code 
> page: 437". (Is that obsolete or even relevant?)

Neither of the two, unless you consider relevance to the correct display of
the results and other console-related operations.
If you're often find yourself in a native console, I suggest you tweak LANG to
match (i.e. LANG=ru_RU.CP866 for me) and only change it to .UTF-8 for MinTTY
(it can help you with override, if you don't want to tweak your startup files).

> One solution seems to be to put a BOM in each UTF-8 text 
> file; then the files read fine in both environments, but 
> that's not conveniently accomplished.

That's hardly a solution, though. More like a workaround.


-- 
With best regards,
Andrey Repin
Thursday, May 25, 2017 18:48:43

Sorry for my terrible english...


--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2017-05-25 16:05 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-05-25 15:36 UTF-8 compatibility between Windows and Cygwin Nellis, Kenneth
2017-05-25 17:34 ` Andrey Repin

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).