public inbox for libc-locales@sourceware.org
 help / color / mirror / Atom feed
* [PATCH] Fix Charset comment in fi_FI, fi_FI@euro
@ 2013-11-26 17:06 Marko Myllynen
  2013-11-26 17:26 ` keld
  0 siblings, 1 reply; 8+ messages in thread
From: Marko Myllynen @ 2013-11-26 17:06 UTC (permalink / raw)
  To: GNU C Library, libc-locales

Hi,

please see the patch below to fix the Charset comment in fi_FI and
fi_FI@euro. ISO-8859-1 is certainly incorrect with Euro and since SFS-EN
13710 (which we will hopefully support later) specifies a
non-ISO-8859-15 character (ANGSTROM SIGN) it seems best to use UTF-8.

2013-11-26  Marko Myllynen  <myllynen@redhat.com>

	* locales/fi_FI: Fix Charset comment.
	* locales/fi_FI@euro: Likewise.

diff --git a/localedata/locales/fi_FI b/localedata/locales/fi_FI
index f779964..6ea9301 100644
--- a/localedata/locales/fi_FI
+++ b/localedata/locales/fi_FI
@@ -16,7 +16,7 @@ comment_char    %
 % Date: 1996-10-15
 % Application: general
 % Users: general
-% Charset: ISO-8859-1
+% Charset: UTF-8
 % Distribution and use is free, also
 % for commercial purposes.
 %
diff --git a/localedata/locales/fi_FI@euro b/localedata/locales/fi_FI@euro
index 76f78ee..ef7f8cd 100644
--- a/localedata/locales/fi_FI@euro
+++ b/localedata/locales/fi_FI@euro
@@ -8,7 +8,7 @@ comment_char    %
 % Date: 2000-08-20
 % Application: general
 % Users: general
-% Charset: ISO-8859-15
+% Charset: UTF-8
 % Distribution and use is free, also
 % for commercial purposes.


-- 
Marko Myllynen

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] Fix Charset comment in fi_FI, fi_FI@euro
  2013-11-26 17:06 [PATCH] Fix Charset comment in fi_FI, fi_FI@euro Marko Myllynen
@ 2013-11-26 17:26 ` keld
  2013-11-26 17:40   ` Marko Myllynen
  0 siblings, 1 reply; 8+ messages in thread
From: keld @ 2013-11-26 17:26 UTC (permalink / raw)
  To: Marko Myllynen; +Cc: GNU C Library, libc-locales

It is not necessary to specify UTF-8, all locales support UTF-8.
The question is whether the locale can be used with iso-8859-15. 
More than one charset may be specified with the charset comment.
If you like you could specify both UTF-8 and iso-8859-15,
with utf-8 first, if that is the preference.

What is ANGSTROM used for? As an alternative to Ã… in the sorting?
And other places?

Best regards
keld

On Tue, Nov 26, 2013 at 07:05:55PM +0200, Marko Myllynen wrote:
> Hi,
> 
> please see the patch below to fix the Charset comment in fi_FI and
> fi_FI@euro. ISO-8859-1 is certainly incorrect with Euro and since SFS-EN
> 13710 (which we will hopefully support later) specifies a
> non-ISO-8859-15 character (ANGSTROM SIGN) it seems best to use UTF-8.
> 
> 2013-11-26  Marko Myllynen  <myllynen@redhat.com>
> 
> 	* locales/fi_FI: Fix Charset comment.
> 	* locales/fi_FI@euro: Likewise.
> 
> diff --git a/localedata/locales/fi_FI b/localedata/locales/fi_FI
> index f779964..6ea9301 100644
> --- a/localedata/locales/fi_FI
> +++ b/localedata/locales/fi_FI
> @@ -16,7 +16,7 @@ comment_char    %
>  % Date: 1996-10-15
>  % Application: general
>  % Users: general
> -% Charset: ISO-8859-1
> +% Charset: UTF-8
>  % Distribution and use is free, also
>  % for commercial purposes.
>  %
> diff --git a/localedata/locales/fi_FI@euro b/localedata/locales/fi_FI@euro
> index 76f78ee..ef7f8cd 100644
> --- a/localedata/locales/fi_FI@euro
> +++ b/localedata/locales/fi_FI@euro
> @@ -8,7 +8,7 @@ comment_char    %
>  % Date: 2000-08-20
>  % Application: general
>  % Users: general
> -% Charset: ISO-8859-15
> +% Charset: UTF-8
>  % Distribution and use is free, also
>  % for commercial purposes.
> 
> 
> -- 
> Marko Myllynen

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] Fix Charset comment in fi_FI, fi_FI@euro
  2013-11-26 17:26 ` keld
@ 2013-11-26 17:40   ` Marko Myllynen
  2013-11-26 18:12     ` keld
  0 siblings, 1 reply; 8+ messages in thread
From: Marko Myllynen @ 2013-11-26 17:40 UTC (permalink / raw)
  To: keld; +Cc: GNU C Library, libc-locales

Hi,

ANGSTROM SIGN is specified in the standard to sort it alongside with å/Å.

There is also a recent proposal to extend the auxiliary character set
(that's the closest translation I came up with, it's
"lisämerkkivalikoima" in Finnish in the proposal) with characters that
might be needed when processing texts in the Finnish context and the
proposal contains several non-ISO-8859-15 characters, see
http://www.kotoistus.fi/kannanotot_html/avoimet/lisamerkisto-2013.

So I think it's better not to suggest using ISO-8859-15 any more.

Thanks,

On 2013-11-26 19:25, keld@keldix.com wrote:
> It is not necessary to specify UTF-8, all locales support UTF-8.
> The question is whether the locale can be used with iso-8859-15. 
> More than one charset may be specified with the charset comment.
> If you like you could specify both UTF-8 and iso-8859-15,
> with utf-8 first, if that is the preference.
> 
> What is ANGSTROM used for? As an alternative to Ã… in the sorting?
> And other places?
> 
> Best regards
> keld
> 
> On Tue, Nov 26, 2013 at 07:05:55PM +0200, Marko Myllynen wrote:
>> Hi,
>>
>> please see the patch below to fix the Charset comment in fi_FI and
>> fi_FI@euro. ISO-8859-1 is certainly incorrect with Euro and since SFS-EN
>> 13710 (which we will hopefully support later) specifies a
>> non-ISO-8859-15 character (ANGSTROM SIGN) it seems best to use UTF-8.
>>
>> 2013-11-26  Marko Myllynen  <myllynen@redhat.com>
>>
>> 	* locales/fi_FI: Fix Charset comment.
>> 	* locales/fi_FI@euro: Likewise.
>>
>> diff --git a/localedata/locales/fi_FI b/localedata/locales/fi_FI
>> index f779964..6ea9301 100644
>> --- a/localedata/locales/fi_FI
>> +++ b/localedata/locales/fi_FI
>> @@ -16,7 +16,7 @@ comment_char    %
>>  % Date: 1996-10-15
>>  % Application: general
>>  % Users: general
>> -% Charset: ISO-8859-1
>> +% Charset: UTF-8
>>  % Distribution and use is free, also
>>  % for commercial purposes.
>>  %
>> diff --git a/localedata/locales/fi_FI@euro b/localedata/locales/fi_FI@euro
>> index 76f78ee..ef7f8cd 100644
>> --- a/localedata/locales/fi_FI@euro
>> +++ b/localedata/locales/fi_FI@euro
>> @@ -8,7 +8,7 @@ comment_char    %
>>  % Date: 2000-08-20
>>  % Application: general
>>  % Users: general
>> -% Charset: ISO-8859-15
>> +% Charset: UTF-8
>>  % Distribution and use is free, also
>>  % for commercial purposes.
>>
>>
>> -- 
>> Marko Myllynen


-- 
Marko Myllynen

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] Fix Charset comment in fi_FI, fi_FI@euro
  2013-11-26 17:40   ` Marko Myllynen
@ 2013-11-26 18:12     ` keld
  2013-11-26 18:24       ` Marko Myllynen
  0 siblings, 1 reply; 8+ messages in thread
From: keld @ 2013-11-26 18:12 UTC (permalink / raw)
  To: Marko Myllynen; +Cc: GNU C Library, libc-locales

Hi again

Well, libc locales are able to sort the full ucs charset, so that is not an argument to
exclude iso-8859-15. Finland uses Euro, so that excludes iso-8859-1.
I do not think we have specs that specifies use of citation marks etc, which could be
characters outside 8859-15 for Finnish.

The Charset comment is meant to give charsets for which a locale would compile
with localedef without fatal errors. It can be used to generate
all the locale/charset combinations that a locale is valid in.

Best regards
keld

On Tue, Nov 26, 2013 at 07:38:38PM +0200, Marko Myllynen wrote:
> Hi,
> 
> ANGSTROM SIGN is specified in the standard to sort it alongside with å/Å.
> 
> There is also a recent proposal to extend the auxiliary character set
> (that's the closest translation I came up with, it's
> "lisämerkkivalikoima" in Finnish in the proposal) with characters that
> might be needed when processing texts in the Finnish context and the
> proposal contains several non-ISO-8859-15 characters, see
> http://www.kotoistus.fi/kannanotot_html/avoimet/lisamerkisto-2013.
> 
> So I think it's better not to suggest using ISO-8859-15 any more.
> 
> Thanks,
> 
> On 2013-11-26 19:25, keld@keldix.com wrote:
> > It is not necessary to specify UTF-8, all locales support UTF-8.
> > The question is whether the locale can be used with iso-8859-15. 
> > More than one charset may be specified with the charset comment.
> > If you like you could specify both UTF-8 and iso-8859-15,
> > with utf-8 first, if that is the preference.
> > 
> > What is ANGSTROM used for? As an alternative to Ã… in the sorting?
> > And other places?
> > 
> > Best regards
> > keld
> > 
> > On Tue, Nov 26, 2013 at 07:05:55PM +0200, Marko Myllynen wrote:
> >> Hi,
> >>
> >> please see the patch below to fix the Charset comment in fi_FI and
> >> fi_FI@euro. ISO-8859-1 is certainly incorrect with Euro and since SFS-EN
> >> 13710 (which we will hopefully support later) specifies a
> >> non-ISO-8859-15 character (ANGSTROM SIGN) it seems best to use UTF-8.
> >>
> >> 2013-11-26  Marko Myllynen  <myllynen@redhat.com>
> >>
> >> 	* locales/fi_FI: Fix Charset comment.
> >> 	* locales/fi_FI@euro: Likewise.
> >>
> >> diff --git a/localedata/locales/fi_FI b/localedata/locales/fi_FI
> >> index f779964..6ea9301 100644
> >> --- a/localedata/locales/fi_FI
> >> +++ b/localedata/locales/fi_FI
> >> @@ -16,7 +16,7 @@ comment_char    %
> >>  % Date: 1996-10-15
> >>  % Application: general
> >>  % Users: general
> >> -% Charset: ISO-8859-1
> >> +% Charset: UTF-8
> >>  % Distribution and use is free, also
> >>  % for commercial purposes.
> >>  %
> >> diff --git a/localedata/locales/fi_FI@euro b/localedata/locales/fi_FI@euro
> >> index 76f78ee..ef7f8cd 100644
> >> --- a/localedata/locales/fi_FI@euro
> >> +++ b/localedata/locales/fi_FI@euro
> >> @@ -8,7 +8,7 @@ comment_char    %
> >>  % Date: 2000-08-20
> >>  % Application: general
> >>  % Users: general
> >> -% Charset: ISO-8859-15
> >> +% Charset: UTF-8
> >>  % Distribution and use is free, also
> >>  % for commercial purposes.
> >>
> >>
> >> -- 
> >> Marko Myllynen
> 
> 
> -- 
> Marko Myllynen

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] Fix Charset comment in fi_FI, fi_FI@euro
  2013-11-26 18:12     ` keld
@ 2013-11-26 18:24       ` Marko Myllynen
  2013-11-26 21:57         ` keld
  0 siblings, 1 reply; 8+ messages in thread
From: Marko Myllynen @ 2013-11-26 18:24 UTC (permalink / raw)
  To: keld; +Cc: GNU C Library, libc-locales

Hi,

correct, no non-ISO-8859-15 characters are used in the locale itself
elsewhere.

Updated patch below, ok now? Thanks.

2013-11-26  Marko Myllynen  <myllynen@redhat.com>

	* locales/fi_FI: Fix Charset comment.
	* locales/fi_FI@euro: Likewise.

diff --git a/localedata/locales/fi_FI b/localedata/locales/fi_FI
index f779964..6ea9301 100644
--- a/localedata/locales/fi_FI
+++ b/localedata/locales/fi_FI
@@ -16,7 +16,7 @@ comment_char    %
 % Date: 1996-10-15
 % Application: general
 % Users: general
-% Charset: ISO-8859-1
+% Charset: UTF-8, ISO-8859-15
 % Distribution and use is free, also
 % for commercial purposes.
 %
diff --git a/localedata/locales/fi_FI@euro b/localedata/locales/fi_FI@euro
index 76f78ee..ef7f8cd 100644
--- a/localedata/locales/fi_FI@euro
+++ b/localedata/locales/fi_FI@euro
@@ -8,7 +8,7 @@ comment_char    %
 % Date: 2000-08-20
 % Application: general
 % Users: general
-% Charset: ISO-8859-15
+% Charset: UTF-8, ISO-8859-15
 % Distribution and use is free, also
 % for commercial purposes.



On 2013-11-26 20:11, keld@keldix.com wrote:
> Hi again
> 
> Well, libc locales are able to sort the full ucs charset, so that is not an argument to
> exclude iso-8859-15. Finland uses Euro, so that excludes iso-8859-1.
> I do not think we have specs that specifies use of citation marks etc, which could be
> characters outside 8859-15 for Finnish.
> 
> The Charset comment is meant to give charsets for which a locale would compile
> with localedef without fatal errors. It can be used to generate
> all the locale/charset combinations that a locale is valid in.
> 
> Best regards
> keld
> 
> On Tue, Nov 26, 2013 at 07:38:38PM +0200, Marko Myllynen wrote:
>> Hi,
>>
>> ANGSTROM SIGN is specified in the standard to sort it alongside with å/Å.
>>
>> There is also a recent proposal to extend the auxiliary character set
>> (that's the closest translation I came up with, it's
>> "lisämerkkivalikoima" in Finnish in the proposal) with characters that
>> might be needed when processing texts in the Finnish context and the
>> proposal contains several non-ISO-8859-15 characters, see
>> http://www.kotoistus.fi/kannanotot_html/avoimet/lisamerkisto-2013.
>>
>> So I think it's better not to suggest using ISO-8859-15 any more.
>>
>> Thanks,
>>
>> On 2013-11-26 19:25, keld@keldix.com wrote:
>>> It is not necessary to specify UTF-8, all locales support UTF-8.
>>> The question is whether the locale can be used with iso-8859-15. 
>>> More than one charset may be specified with the charset comment.
>>> If you like you could specify both UTF-8 and iso-8859-15,
>>> with utf-8 first, if that is the preference.
>>>
>>> What is ANGSTROM used for? As an alternative to Ã… in the sorting?
>>> And other places?
>>>
>>> Best regards
>>> keld
>>>
>>> On Tue, Nov 26, 2013 at 07:05:55PM +0200, Marko Myllynen wrote:
>>>> Hi,
>>>>
>>>> please see the patch below to fix the Charset comment in fi_FI and
>>>> fi_FI@euro. ISO-8859-1 is certainly incorrect with Euro and since SFS-EN
>>>> 13710 (which we will hopefully support later) specifies a
>>>> non-ISO-8859-15 character (ANGSTROM SIGN) it seems best to use UTF-8.
>>>>
>>>> 2013-11-26  Marko Myllynen  <myllynen@redhat.com>
>>>>
>>>> 	* locales/fi_FI: Fix Charset comment.
>>>> 	* locales/fi_FI@euro: Likewise.
>>>>
>>>> diff --git a/localedata/locales/fi_FI b/localedata/locales/fi_FI
>>>> index f779964..6ea9301 100644
>>>> --- a/localedata/locales/fi_FI
>>>> +++ b/localedata/locales/fi_FI
>>>> @@ -16,7 +16,7 @@ comment_char    %
>>>>  % Date: 1996-10-15
>>>>  % Application: general
>>>>  % Users: general
>>>> -% Charset: ISO-8859-1
>>>> +% Charset: UTF-8
>>>>  % Distribution and use is free, also
>>>>  % for commercial purposes.
>>>>  %
>>>> diff --git a/localedata/locales/fi_FI@euro b/localedata/locales/fi_FI@euro
>>>> index 76f78ee..ef7f8cd 100644
>>>> --- a/localedata/locales/fi_FI@euro
>>>> +++ b/localedata/locales/fi_FI@euro
>>>> @@ -8,7 +8,7 @@ comment_char    %
>>>>  % Date: 2000-08-20
>>>>  % Application: general
>>>>  % Users: general
>>>> -% Charset: ISO-8859-15
>>>> +% Charset: UTF-8
>>>>  % Distribution and use is free, also
>>>>  % for commercial purposes.
>>>>
>>>>
>>>> -- 
>>>> Marko Myllynen
>>
>>
>> -- 
>> Marko Myllynen


-- 
Marko Myllynen

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] Fix Charset comment in fi_FI, fi_FI@euro
  2013-11-26 18:24       ` Marko Myllynen
@ 2013-11-26 21:57         ` keld
  2013-12-10 12:38           ` Marko Myllynen
  0 siblings, 1 reply; 8+ messages in thread
From: keld @ 2013-11-26 21:57 UTC (permalink / raw)
  To: Marko Myllynen; +Cc: GNU C Library, libc-locales

Hi Marko

The patch is OK with me

Kiitos, 
Keld

On Tue, Nov 26, 2013 at 08:23:31PM +0200, Marko Myllynen wrote:
> Hi,
> 
> correct, no non-ISO-8859-15 characters are used in the locale itself
> elsewhere.
> 
> Updated patch below, ok now? Thanks.
> 
> 2013-11-26  Marko Myllynen  <myllynen@redhat.com>
> 
> 	* locales/fi_FI: Fix Charset comment.
> 	* locales/fi_FI@euro: Likewise.
> 
> diff --git a/localedata/locales/fi_FI b/localedata/locales/fi_FI
> index f779964..6ea9301 100644
> --- a/localedata/locales/fi_FI
> +++ b/localedata/locales/fi_FI
> @@ -16,7 +16,7 @@ comment_char    %
>  % Date: 1996-10-15
>  % Application: general
>  % Users: general
> -% Charset: ISO-8859-1
> +% Charset: UTF-8, ISO-8859-15
>  % Distribution and use is free, also
>  % for commercial purposes.
>  %
> diff --git a/localedata/locales/fi_FI@euro b/localedata/locales/fi_FI@euro
> index 76f78ee..ef7f8cd 100644
> --- a/localedata/locales/fi_FI@euro
> +++ b/localedata/locales/fi_FI@euro
> @@ -8,7 +8,7 @@ comment_char    %
>  % Date: 2000-08-20
>  % Application: general
>  % Users: general
> -% Charset: ISO-8859-15
> +% Charset: UTF-8, ISO-8859-15
>  % Distribution and use is free, also
>  % for commercial purposes.
> 
> 
> 
> On 2013-11-26 20:11, keld@keldix.com wrote:
> > Hi again
> > 
> > Well, libc locales are able to sort the full ucs charset, so that is not an argument to
> > exclude iso-8859-15. Finland uses Euro, so that excludes iso-8859-1.
> > I do not think we have specs that specifies use of citation marks etc, which could be
> > characters outside 8859-15 for Finnish.
> > 
> > The Charset comment is meant to give charsets for which a locale would compile
> > with localedef without fatal errors. It can be used to generate
> > all the locale/charset combinations that a locale is valid in.
> > 
> > Best regards
> > keld
> > 
> > On Tue, Nov 26, 2013 at 07:38:38PM +0200, Marko Myllynen wrote:
> >> Hi,
> >>
> >> ANGSTROM SIGN is specified in the standard to sort it alongside with å/Å.
> >>
> >> There is also a recent proposal to extend the auxiliary character set
> >> (that's the closest translation I came up with, it's
> >> "lisämerkkivalikoima" in Finnish in the proposal) with characters that
> >> might be needed when processing texts in the Finnish context and the
> >> proposal contains several non-ISO-8859-15 characters, see
> >> http://www.kotoistus.fi/kannanotot_html/avoimet/lisamerkisto-2013.
> >>
> >> So I think it's better not to suggest using ISO-8859-15 any more.
> >>
> >> Thanks,
> >>
> >> On 2013-11-26 19:25, keld@keldix.com wrote:
> >>> It is not necessary to specify UTF-8, all locales support UTF-8.
> >>> The question is whether the locale can be used with iso-8859-15. 
> >>> More than one charset may be specified with the charset comment.
> >>> If you like you could specify both UTF-8 and iso-8859-15,
> >>> with utf-8 first, if that is the preference.
> >>>
> >>> What is ANGSTROM used for? As an alternative to Ã… in the sorting?
> >>> And other places?
> >>>
> >>> Best regards
> >>> keld
> >>>
> >>> On Tue, Nov 26, 2013 at 07:05:55PM +0200, Marko Myllynen wrote:
> >>>> Hi,
> >>>>
> >>>> please see the patch below to fix the Charset comment in fi_FI and
> >>>> fi_FI@euro. ISO-8859-1 is certainly incorrect with Euro and since SFS-EN
> >>>> 13710 (which we will hopefully support later) specifies a
> >>>> non-ISO-8859-15 character (ANGSTROM SIGN) it seems best to use UTF-8.
> >>>>
> >>>> 2013-11-26  Marko Myllynen  <myllynen@redhat.com>
> >>>>
> >>>> 	* locales/fi_FI: Fix Charset comment.
> >>>> 	* locales/fi_FI@euro: Likewise.
> >>>>
> >>>> diff --git a/localedata/locales/fi_FI b/localedata/locales/fi_FI
> >>>> index f779964..6ea9301 100644
> >>>> --- a/localedata/locales/fi_FI
> >>>> +++ b/localedata/locales/fi_FI
> >>>> @@ -16,7 +16,7 @@ comment_char    %
> >>>>  % Date: 1996-10-15
> >>>>  % Application: general
> >>>>  % Users: general
> >>>> -% Charset: ISO-8859-1
> >>>> +% Charset: UTF-8
> >>>>  % Distribution and use is free, also
> >>>>  % for commercial purposes.
> >>>>  %
> >>>> diff --git a/localedata/locales/fi_FI@euro b/localedata/locales/fi_FI@euro
> >>>> index 76f78ee..ef7f8cd 100644
> >>>> --- a/localedata/locales/fi_FI@euro
> >>>> +++ b/localedata/locales/fi_FI@euro
> >>>> @@ -8,7 +8,7 @@ comment_char    %
> >>>>  % Date: 2000-08-20
> >>>>  % Application: general
> >>>>  % Users: general
> >>>> -% Charset: ISO-8859-15
> >>>> +% Charset: UTF-8
> >>>>  % Distribution and use is free, also
> >>>>  % for commercial purposes.
> >>>>
> >>>>
> >>>> -- 
> >>>> Marko Myllynen
> >>
> >>
> >> -- 
> >> Marko Myllynen
> 
> 
> -- 
> Marko Myllynen

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] Fix Charset comment in fi_FI, fi_FI@euro
  2013-11-26 21:57         ` keld
@ 2013-12-10 12:38           ` Marko Myllynen
  2013-12-12  4:01             ` Siddhesh Poyarekar
  0 siblings, 1 reply; 8+ messages in thread
From: Marko Myllynen @ 2013-12-10 12:38 UTC (permalink / raw)
  Cc: GNU C Library, libc-locales

Ping / could someone commit this one, please?

https://sourceware.org/ml/libc-alpha/2013-11/msg00787.html

Thanks,

On 2013-11-26 21:39, keld@keldix.com wrote:
> Hi Marko
> 
> The patch is OK with me
> 
> Kiitos, 
> Keld
> 
> On Tue, Nov 26, 2013 at 08:23:31PM +0200, Marko Myllynen wrote:
>> Hi,
>>
>> correct, no non-ISO-8859-15 characters are used in the locale itself
>> elsewhere.
>>
>> Updated patch below, ok now? Thanks.
>>
>> 2013-11-26  Marko Myllynen  <myllynen@redhat.com>
>>
>> 	* locales/fi_FI: Fix Charset comment.
>> 	* locales/fi_FI@euro: Likewise.
>>
>> diff --git a/localedata/locales/fi_FI b/localedata/locales/fi_FI
>> index f779964..6ea9301 100644
>> --- a/localedata/locales/fi_FI
>> +++ b/localedata/locales/fi_FI
>> @@ -16,7 +16,7 @@ comment_char    %
>>  % Date: 1996-10-15
>>  % Application: general
>>  % Users: general
>> -% Charset: ISO-8859-1
>> +% Charset: UTF-8, ISO-8859-15
>>  % Distribution and use is free, also
>>  % for commercial purposes.
>>  %
>> diff --git a/localedata/locales/fi_FI@euro b/localedata/locales/fi_FI@euro
>> index 76f78ee..ef7f8cd 100644
>> --- a/localedata/locales/fi_FI@euro
>> +++ b/localedata/locales/fi_FI@euro
>> @@ -8,7 +8,7 @@ comment_char    %
>>  % Date: 2000-08-20
>>  % Application: general
>>  % Users: general
>> -% Charset: ISO-8859-15
>> +% Charset: UTF-8, ISO-8859-15
>>  % Distribution and use is free, also
>>  % for commercial purposes.
>>
>>
>>
>> On 2013-11-26 20:11, keld@keldix.com wrote:
>>> Hi again
>>>
>>> Well, libc locales are able to sort the full ucs charset, so that is not an argument to
>>> exclude iso-8859-15. Finland uses Euro, so that excludes iso-8859-1.
>>> I do not think we have specs that specifies use of citation marks etc, which could be
>>> characters outside 8859-15 for Finnish.
>>>
>>> The Charset comment is meant to give charsets for which a locale would compile
>>> with localedef without fatal errors. It can be used to generate
>>> all the locale/charset combinations that a locale is valid in.
>>>
>>> Best regards
>>> keld
>>>
>>> On Tue, Nov 26, 2013 at 07:38:38PM +0200, Marko Myllynen wrote:
>>>> Hi,
>>>>
>>>> ANGSTROM SIGN is specified in the standard to sort it alongside with å/Å.
>>>>
>>>> There is also a recent proposal to extend the auxiliary character set
>>>> (that's the closest translation I came up with, it's
>>>> "lisämerkkivalikoima" in Finnish in the proposal) with characters that
>>>> might be needed when processing texts in the Finnish context and the
>>>> proposal contains several non-ISO-8859-15 characters, see
>>>> http://www.kotoistus.fi/kannanotot_html/avoimet/lisamerkisto-2013.
>>>>
>>>> So I think it's better not to suggest using ISO-8859-15 any more.
>>>>
>>>> Thanks,
>>>>
>>>> On 2013-11-26 19:25, keld@keldix.com wrote:
>>>>> It is not necessary to specify UTF-8, all locales support UTF-8.
>>>>> The question is whether the locale can be used with iso-8859-15. 
>>>>> More than one charset may be specified with the charset comment.
>>>>> If you like you could specify both UTF-8 and iso-8859-15,
>>>>> with utf-8 first, if that is the preference.
>>>>>
>>>>> What is ANGSTROM used for? As an alternative to Ã… in the sorting?
>>>>> And other places?
>>>>>
>>>>> Best regards
>>>>> keld
>>>>>
>>>>> On Tue, Nov 26, 2013 at 07:05:55PM +0200, Marko Myllynen wrote:
>>>>>> Hi,
>>>>>>
>>>>>> please see the patch below to fix the Charset comment in fi_FI and
>>>>>> fi_FI@euro. ISO-8859-1 is certainly incorrect with Euro and since SFS-EN
>>>>>> 13710 (which we will hopefully support later) specifies a
>>>>>> non-ISO-8859-15 character (ANGSTROM SIGN) it seems best to use UTF-8.
>>>>>>
>>>>>> 2013-11-26  Marko Myllynen  <myllynen@redhat.com>
>>>>>>
>>>>>> 	* locales/fi_FI: Fix Charset comment.
>>>>>> 	* locales/fi_FI@euro: Likewise.
>>>>>>
>>>>>> diff --git a/localedata/locales/fi_FI b/localedata/locales/fi_FI
>>>>>> index f779964..6ea9301 100644
>>>>>> --- a/localedata/locales/fi_FI
>>>>>> +++ b/localedata/locales/fi_FI
>>>>>> @@ -16,7 +16,7 @@ comment_char    %
>>>>>>  % Date: 1996-10-15
>>>>>>  % Application: general
>>>>>>  % Users: general
>>>>>> -% Charset: ISO-8859-1
>>>>>> +% Charset: UTF-8
>>>>>>  % Distribution and use is free, also
>>>>>>  % for commercial purposes.
>>>>>>  %
>>>>>> diff --git a/localedata/locales/fi_FI@euro b/localedata/locales/fi_FI@euro
>>>>>> index 76f78ee..ef7f8cd 100644
>>>>>> --- a/localedata/locales/fi_FI@euro
>>>>>> +++ b/localedata/locales/fi_FI@euro
>>>>>> @@ -8,7 +8,7 @@ comment_char    %
>>>>>>  % Date: 2000-08-20
>>>>>>  % Application: general
>>>>>>  % Users: general
>>>>>> -% Charset: ISO-8859-15
>>>>>> +% Charset: UTF-8
>>>>>>  % Distribution and use is free, also
>>>>>>  % for commercial purposes.
>>>>>>
>>>>>>
>>>>>> -- 
>>>>>> Marko Myllynen
>>>>
>>>>
>>>> -- 
>>>> Marko Myllynen
>>
>>
>> -- 
>> Marko Myllynen


-- 
Marko Myllynen

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] Fix Charset comment in fi_FI, fi_FI@euro
  2013-12-10 12:38           ` Marko Myllynen
@ 2013-12-12  4:01             ` Siddhesh Poyarekar
  0 siblings, 0 replies; 8+ messages in thread
From: Siddhesh Poyarekar @ 2013-12-12  4:01 UTC (permalink / raw)
  To: Marko Myllynen; +Cc: GNU C Library, libc-locales

Thanks, I've pushed this now.

Siddhesh

On Tue, Dec 10, 2013 at 02:38:18PM +0200, Marko Myllynen wrote:
> Ping / could someone commit this one, please?
> 
> https://sourceware.org/ml/libc-alpha/2013-11/msg00787.html
> 
> Thanks,
> 
> On 2013-11-26 21:39, keld@keldix.com wrote:
> > Hi Marko
> > 
> > The patch is OK with me
> > 
> > Kiitos, 
> > Keld
> > 
> > On Tue, Nov 26, 2013 at 08:23:31PM +0200, Marko Myllynen wrote:
> >> Hi,
> >>
> >> correct, no non-ISO-8859-15 characters are used in the locale itself
> >> elsewhere.
> >>
> >> Updated patch below, ok now? Thanks.
> >>
> >> 2013-11-26  Marko Myllynen  <myllynen@redhat.com>
> >>
> >> 	* locales/fi_FI: Fix Charset comment.
> >> 	* locales/fi_FI@euro: Likewise.
> >>
> >> diff --git a/localedata/locales/fi_FI b/localedata/locales/fi_FI
> >> index f779964..6ea9301 100644
> >> --- a/localedata/locales/fi_FI
> >> +++ b/localedata/locales/fi_FI
> >> @@ -16,7 +16,7 @@ comment_char    %
> >>  % Date: 1996-10-15
> >>  % Application: general
> >>  % Users: general
> >> -% Charset: ISO-8859-1
> >> +% Charset: UTF-8, ISO-8859-15
> >>  % Distribution and use is free, also
> >>  % for commercial purposes.
> >>  %
> >> diff --git a/localedata/locales/fi_FI@euro b/localedata/locales/fi_FI@euro
> >> index 76f78ee..ef7f8cd 100644
> >> --- a/localedata/locales/fi_FI@euro
> >> +++ b/localedata/locales/fi_FI@euro
> >> @@ -8,7 +8,7 @@ comment_char    %
> >>  % Date: 2000-08-20
> >>  % Application: general
> >>  % Users: general
> >> -% Charset: ISO-8859-15
> >> +% Charset: UTF-8, ISO-8859-15
> >>  % Distribution and use is free, also
> >>  % for commercial purposes.
> >>
> >>
> >>
> >> On 2013-11-26 20:11, keld@keldix.com wrote:
> >>> Hi again
> >>>
> >>> Well, libc locales are able to sort the full ucs charset, so that is not an argument to
> >>> exclude iso-8859-15. Finland uses Euro, so that excludes iso-8859-1.
> >>> I do not think we have specs that specifies use of citation marks etc, which could be
> >>> characters outside 8859-15 for Finnish.
> >>>
> >>> The Charset comment is meant to give charsets for which a locale would compile
> >>> with localedef without fatal errors. It can be used to generate
> >>> all the locale/charset combinations that a locale is valid in.
> >>>
> >>> Best regards
> >>> keld
> >>>
> >>> On Tue, Nov 26, 2013 at 07:38:38PM +0200, Marko Myllynen wrote:
> >>>> Hi,
> >>>>
> >>>> ANGSTROM SIGN is specified in the standard to sort it alongside with å/Å.
> >>>>
> >>>> There is also a recent proposal to extend the auxiliary character set
> >>>> (that's the closest translation I came up with, it's
> >>>> "lisämerkkivalikoima" in Finnish in the proposal) with characters that
> >>>> might be needed when processing texts in the Finnish context and the
> >>>> proposal contains several non-ISO-8859-15 characters, see
> >>>> http://www.kotoistus.fi/kannanotot_html/avoimet/lisamerkisto-2013.
> >>>>
> >>>> So I think it's better not to suggest using ISO-8859-15 any more.
> >>>>
> >>>> Thanks,
> >>>>
> >>>> On 2013-11-26 19:25, keld@keldix.com wrote:
> >>>>> It is not necessary to specify UTF-8, all locales support UTF-8.
> >>>>> The question is whether the locale can be used with iso-8859-15. 
> >>>>> More than one charset may be specified with the charset comment.
> >>>>> If you like you could specify both UTF-8 and iso-8859-15,
> >>>>> with utf-8 first, if that is the preference.
> >>>>>
> >>>>> What is ANGSTROM used for? As an alternative to Ã… in the sorting?
> >>>>> And other places?
> >>>>>
> >>>>> Best regards
> >>>>> keld
> >>>>>
> >>>>> On Tue, Nov 26, 2013 at 07:05:55PM +0200, Marko Myllynen wrote:
> >>>>>> Hi,
> >>>>>>
> >>>>>> please see the patch below to fix the Charset comment in fi_FI and
> >>>>>> fi_FI@euro. ISO-8859-1 is certainly incorrect with Euro and since SFS-EN
> >>>>>> 13710 (which we will hopefully support later) specifies a
> >>>>>> non-ISO-8859-15 character (ANGSTROM SIGN) it seems best to use UTF-8.
> >>>>>>
> >>>>>> 2013-11-26  Marko Myllynen  <myllynen@redhat.com>
> >>>>>>
> >>>>>> 	* locales/fi_FI: Fix Charset comment.
> >>>>>> 	* locales/fi_FI@euro: Likewise.
> >>>>>>
> >>>>>> diff --git a/localedata/locales/fi_FI b/localedata/locales/fi_FI
> >>>>>> index f779964..6ea9301 100644
> >>>>>> --- a/localedata/locales/fi_FI
> >>>>>> +++ b/localedata/locales/fi_FI
> >>>>>> @@ -16,7 +16,7 @@ comment_char    %
> >>>>>>  % Date: 1996-10-15
> >>>>>>  % Application: general
> >>>>>>  % Users: general
> >>>>>> -% Charset: ISO-8859-1
> >>>>>> +% Charset: UTF-8
> >>>>>>  % Distribution and use is free, also
> >>>>>>  % for commercial purposes.
> >>>>>>  %
> >>>>>> diff --git a/localedata/locales/fi_FI@euro b/localedata/locales/fi_FI@euro
> >>>>>> index 76f78ee..ef7f8cd 100644
> >>>>>> --- a/localedata/locales/fi_FI@euro
> >>>>>> +++ b/localedata/locales/fi_FI@euro
> >>>>>> @@ -8,7 +8,7 @@ comment_char    %
> >>>>>>  % Date: 2000-08-20
> >>>>>>  % Application: general
> >>>>>>  % Users: general
> >>>>>> -% Charset: ISO-8859-15
> >>>>>> +% Charset: UTF-8
> >>>>>>  % Distribution and use is free, also
> >>>>>>  % for commercial purposes.
> >>>>>>
> >>>>>>
> >>>>>> -- 
> >>>>>> Marko Myllynen
> >>>>
> >>>>
> >>>> -- 
> >>>> Marko Myllynen
> >>
> >>
> >> -- 
> >> Marko Myllynen
> 
> 
> -- 
> Marko Myllynen

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2013-12-12  4:01 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2013-11-26 17:06 [PATCH] Fix Charset comment in fi_FI, fi_FI@euro Marko Myllynen
2013-11-26 17:26 ` keld
2013-11-26 17:40   ` Marko Myllynen
2013-11-26 18:12     ` keld
2013-11-26 18:24       ` Marko Myllynen
2013-11-26 21:57         ` keld
2013-12-10 12:38           ` Marko Myllynen
2013-12-12  4:01             ` Siddhesh Poyarekar

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).