* UTF-8 compatibility between Windows and Cygwin
@ 2017-05-25 15:36 Nellis, Kenneth
2017-05-25 17:34 ` Andrey Repin
0 siblings, 1 reply; 2+ messages in thread
From: Nellis, Kenneth @ 2017-05-25 15:36 UTC (permalink / raw)
To: cygwin
I have (BOM-less) UTF-8 text files that I can read fine in
Cygwin, but not Windows. When I create text files in Windows
containing non-ASCII characters, I cannot read them in
Cygwin. I understand why, but wondering the best way to be
able to share text files across the two environments.
I'm pretty sure that I want to keep my Cygwin LANG=C.UTF-8
setting, but wondering what I can do on the Windows side for
compatibility and what are the side effects. Currently
Windows's Command Prompt command chcp shows "Active code
page: 437". (Is that obsolete or even relevant?)
One solution seems to be to put a BOM in each UTF-8 text
file; then the files read fine in both environments, but
that's not conveniently accomplished.
--Ken Nellis
--
Problem reports: http://cygwin.com/problems.html
FAQ: http://cygwin.com/faq/
Documentation: http://cygwin.com/docs.html
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: UTF-8 compatibility between Windows and Cygwin
2017-05-25 15:36 UTF-8 compatibility between Windows and Cygwin Nellis, Kenneth
@ 2017-05-25 17:34 ` Andrey Repin
0 siblings, 0 replies; 2+ messages in thread
From: Andrey Repin @ 2017-05-25 17:34 UTC (permalink / raw)
To: Nellis, Kenneth, cygwin
Greetings, Nellis, Kenneth!
> I have (BOM-less) UTF-8 text files that I can read fine in
> Cygwin, but not Windows. When I create text files in Windows
> containing non-ASCII characters, I cannot read them in
> Cygwin. I understand why, but wondering the best way to be
> able to share text files across the two environments.
Please provide `od -t x1z` of the file you are referring to.
> I'm pretty sure that I want to keep my Cygwin LANG=C.UTF-8
> setting,
Better use real language indication, and configure LC_ according to
preferences.
> but wondering what I can do on the Windows side for
> compatibility and what are the side effects.
On Windows side, you could actually use UTF-8.
Without seeing the requested dump, I can only guess, though.
> Currently
> Windows's Command Prompt command chcp shows "Active code
> page: 437". (Is that obsolete or even relevant?)
Neither of the two, unless you consider relevance to the correct display of
the results and other console-related operations.
If you're often find yourself in a native console, I suggest you tweak LANG to
match (i.e. LANG=ru_RU.CP866 for me) and only change it to .UTF-8 for MinTTY
(it can help you with override, if you don't want to tweak your startup files).
> One solution seems to be to put a BOM in each UTF-8 text
> file; then the files read fine in both environments, but
> that's not conveniently accomplished.
That's hardly a solution, though. More like a workaround.
--
With best regards,
Andrey Repin
Thursday, May 25, 2017 18:48:43
Sorry for my terrible english...
--
Problem reports: http://cygwin.com/problems.html
FAQ: http://cygwin.com/faq/
Documentation: http://cygwin.com/docs.html
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2017-05-25 16:05 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-05-25 15:36 UTF-8 compatibility between Windows and Cygwin Nellis, Kenneth
2017-05-25 17:34 ` Andrey Repin
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).