From: Erwin Waterlander <waterlan@xs4all.nl>
To: cygwin@cygwin.com
Subject: Re: How does Cygwin handle non-Latin1 man pages? (move to UTF-8?)
Date: Wed, 25 Sep 2013 20:10:00 -0000 [thread overview]
Message-ID: <524341E5.6080601@xs4all.nl> (raw)
In-Reply-To: <52433E7A.4070600@xs4all.nl>
Erwin Waterlander schreef, Op 25-9-2013 21:50:
>
> Erwin Waterlander schreef, Op 24-9-2013 22:01:
>> Hi,
>>
>> As far as I see it, on Cygwin it is assumed that man pages are
>> encoded in Latin-1 (ISO-8859-1).
>> For instance the man pages of vim.
>>
>> /usr/share/man/fr/vim.1.gz is encoded in Latin-1.
>>
>> $ export LANG=fr_FR.UTF-8
>> $ man vim
>>
>> This will show the French man page correctly. Latin-1 is converted to
>> UTF-8.
>>
>> For the Russian translation of the vim manual I see two files:
>> /usr/share/man/ru.UTF-8/man1/vim.1.gz
>> /usr/share/man/ru.KOI8-R/man1/vim.1.gz
>>
>>
>> When I type
>> $ export LANG=ru_RU.UTF-8
>> $ man vim
>>
>> I get the English man page, instead of the Russian man page.
>> I think because there is no /usr/share/man/ru/man1/vim.1.gz present.
>>
>
> The problem is here that man looks for the manual in these directories
> in this order:
> /usr/share/man/ru_RU.UTF-8
> /usr/share/man/ru_RU
> /usr/share/man/ru
>
> All three paths are not present on Cygwin.
> I could set LANG to ru.UTF-8, but this is not common practice.
> Normally you set LANG to ru_RU.UTF-8. Therefore I think that the
> non-Latin1 folders under /usr/share/man have the wrong name.
> When I set LANG to ru.UTF-8, man finds the Russian man page, but
> displays it wrongly. Even when I fix the NROFF line in /etc/man.conf.
> Moving /usr/share/man/ru.UTF-8 to /usr/share/man/ru_RU.UTF-8 (and
> fixing man.conf) makes the man page display properly. This confirms
> that the non-latin1 directories have the wrong name in Cygwin.
>
>> When I type
>>
>> $ export LANG=ru_RU.UTF-8
>> $ export LANGUAGE=ru.UTF-8
>> $ man vim
>>
>> The Russian man page is displayed, but all Russian characters are
>> wrongly displayed.
>> I think because it is assumed the man page is in Latin-1.
>>
>> To get a correct display of the Russian man page I need to change
>> /etc/man.config
>> I change the line with NROFF to:
>> NROFF /usr/bin/preconv | /usr/bin/nroff -c -mandoc 2>/dev/null
>>
>> Now the Russian man page displays correctly, but now all the Latin-1
>> pages display wrongly.
>
> This can be fixed by adding a coding tag to the first or second line
> of the man page, which is understood by preconv.
> When I set LANG to fr_FR.UTF-8, move /usr/share/man/fr.UTF-8 to
> /usr/share/man/fr_FR.UTF-8, and add this tag to vim.1
>
> .\" -*- coding: latin-1; -*-
>
> The French manual displays properly.
Actually this is not working. Somewhere the coding tag is lost, although
preconv seems to do a good job. I reported this three years ago to the
maintainers of man an groff, but it appears it is still not fixed.
>
>
>>
>> So I undo my change in /etc/man.conf
>>
>>
>> On Linux the trend is to convert all man pages to UTF-8 encoding.
>> Will Cygwin follow this trend?
>>
>>
>
> The following needs to be done in Cygwin to have man pages for all
> scripts displayed properly out of the box (assuming an UTF-8 locale
> and use of mintty):
>
> * Rename the non-latin1 directories under /usr/share/man/ to
> fr_FR.UTF-8, ru_RU.UTF-8, and so on.
> * Change /etc/man.conf to use preconv:
> NROFF /usr/bin/preconv | /usr/bin/nroff -c -mandoc 2>/dev/null
> * Convert all Latin-1 coded man pages to UTF-8, or add a latin-1
> coding tag on the first line
Since the coding tag is not working it is best to convert all man pages
to UTF-8.
--
Erwin Waterlander
http://waterlan.home.xs4all.nl/
--
Problem reports: http://cygwin.com/problems.html
FAQ: http://cygwin.com/faq/
Documentation: http://cygwin.com/docs.html
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
next prev parent reply other threads:[~2013-09-25 20:04 UTC|newest]
Thread overview: 21+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-09-24 20:29 Erwin Waterlander
2013-09-25 19:59 ` Erwin Waterlander
2013-09-25 20:10 ` Erwin Waterlander [this message]
2014-03-14 12:58 ` Cygwin needs a man-db port (was: How does Cygwin handle non-Latin1 man pages? (move to UTF-8?)) Erwin Waterlander
2014-03-14 13:52 ` Cygwin needs a man-db port Chris J. Breisch
2014-03-14 14:38 ` Erwin Waterlander
2014-03-14 15:52 ` Chris J. Breisch
2014-03-14 16:29 ` Erwin Waterlander
2014-03-14 17:05 ` Corinna Vinschen
2014-03-14 19:22 ` Chris J. Breisch
2014-03-16 11:43 ` Chris J. Breisch
2014-04-17 18:32 ` Chris J. Breisch
2014-04-20 6:41 ` Erwin Waterlander
2014-04-21 20:41 ` waterlan
2014-04-22 3:04 ` KIMURA Masaru
2014-04-23 15:18 ` Chris J. Breisch
2014-04-24 18:30 ` waterlan
2014-04-25 0:47 ` Chris J. Breisch
2014-04-24 19:43 ` waterlan
2014-04-24 22:10 ` waterlan
2014-04-30 16:49 ` waterlan
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=524341E5.6080601@xs4all.nl \
--to=waterlan@xs4all.nl \
--cc=cygwin@cygwin.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).