public inbox for libc-alpha@sourceware.org
 help / color / mirror / Atom feed
* [PATCH] New locale: Yakut (Sakha) for Russia (sah_RU) [BZ #22241]
@ 2018-07-16 19:21 Rafal Luzynski
  2018-07-18  0:24 ` Carlos O'Donell
  0 siblings, 1 reply; 3+ messages in thread
From: Rafal Luzynski @ 2018-07-16 19:21 UTC (permalink / raw)
  To: libc-alpha

[-- Attachment #1: Type: text/plain, Size: 229 bytes --]

This is approved by Mike [1] but as we are in a slushy freeze period
I'd like to have more approvals before pushing.  OK for 2.28?  Carlos?
Anybody?

Regards,

Rafal


[1] https://sourceware.org/bugzilla/show_bug.cgi?id=22241#c17

[-- Attachment #2: 0001-New-locale-Yakut-Sakha-for-Russia-sah_RU-BZ-22241.patch --]
[-- Type: text/x-patch, Size: 13300 bytes --]

From 6d533c1e72f76dd5b602948f57f9d32cec05c0fd Mon Sep 17 00:00:00 2001
From: Valery Timiriliyev <timiriliyev@gmail.com>
Date: Fri, 6 Jul 2018 14:46:13 +0900
Subject: [PATCH] New locale: Yakut (Sakha) for Russia (sah_RU) [BZ #22241]

	* localedata/Makefile (test-input): Add sah_RU.UTF-8.
	(LOCALES): Likewise.
	* localedata/SUPPORTED (sah_RU/UTF-8): New entry.
	* localedata/locales/sah_RU: New file.
	* localedata/sah_RU.UTF-8.in: New file.
---
 NEWS                       |   4 +-
 localedata/Makefile        |   2 +
 localedata/SUPPORTED       |   1 +
 localedata/locales/sah_RU  | 291 +++++++++++++++++++++++++++++++++++++++++++++
 localedata/sah_RU.UTF-8.in |  84 +++++++++++++
 5 files changed, 380 insertions(+), 2 deletions(-)
 create mode 100644 localedata/locales/sah_RU
 create mode 100644 localedata/sah_RU.UTF-8.in

diff --git a/NEWS b/NEWS
index 5de2c28..325158a 100644
--- a/NEWS
+++ b/NEWS
@@ -40,8 +40,8 @@ Major new features:
   Upper Sorbian, and Walloon.  The following languages now support
   abbreviated alternative month names: Catalan, Greek, and Kashubian.
 
-* New locale added: Lower Sorbian (dsb_DE), including the support of the
-  nominative and genitive month names.
+* Newly added locales: Lower Sorbian (dsb_DE) and Yakut (sah_RU) also
+  include the support of the nominative and genitive month names.
 
 * Building and running on GNU/Hurd systems now works without out-of-tree
   patches.
diff --git a/localedata/Makefile b/localedata/Makefile
index 5ac4f74..43d2d37 100644
--- a/localedata/Makefile
+++ b/localedata/Makefile
@@ -95,6 +95,7 @@ test-input := \
 	ps_AF.UTF-8 \
 	ro_RO.UTF-8 \
 	ru_RU.UTF-8 \
+	sah_RU.UTF-8 \
 	sc_IT.UTF-8 \
 	se_NO.UTF-8 \
 	si_LK.UTF-8 \
@@ -250,6 +251,7 @@ LOCALES := \
 	ps_AF.UTF-8 \
 	ro_RO.UTF-8 \
 	ru_RU.UTF-8 \
+	sah_RU.UTF-8 \
 	sc_IT.UTF-8 \
 	se_NO.UTF-8 \
 	si_LK.UTF-8 \
diff --git a/localedata/SUPPORTED b/localedata/SUPPORTED
index 74aa15d..05de4e3 100644
--- a/localedata/SUPPORTED
+++ b/localedata/SUPPORTED
@@ -384,6 +384,7 @@ ru_UA.UTF-8/UTF-8 \
 ru_UA/KOI8-U \
 rw_RW/UTF-8 \
 sa_IN/UTF-8 \
+sah_RU/UTF-8 \
 sat_IN/UTF-8 \
 sc_IT/UTF-8 \
 sd_IN/UTF-8 \
diff --git a/localedata/locales/sah_RU b/localedata/locales/sah_RU
new file mode 100644
index 0000000..65af029
--- /dev/null
+++ b/localedata/locales/sah_RU
@@ -0,0 +1,291 @@
+escape_char  /
+comment_char  %
+
+% Yakut (Sakha) locale for Russian Federation
+% Source: Valery Timiriliyev
+% Email: timiriliyev@gmail.com
+% Tel:
+% Fax:
+% Language: sah
+% Territory: RU
+% Revision: 1.1.0
+% Date: 2018-07-06
+% Users: general
+%
+
+LC_IDENTIFICATION
+title      "Yakut (Sakha) locale for Russian Federation"
+source     "Valery Timiriliyev"
+address    ""
+contact    "Valery Timiriliyev"
+email      "timiriliyev@gmail.com"
+tel        ""
+fax        ""
+language   "Yakut"
+territory  "Russian Federation"
+revision   "1.1.0"
+date       "2018-07-06"
+%
+category  "i18n:2012";LC_IDENTIFICATION
+category  "i18n:2012";LC_CTYPE
+category  "i18n:2012";LC_COLLATE
+category  "i18n:2012";LC_TIME
+category  "i18n:2012";LC_NUMERIC
+category  "i18n:2012";LC_MONETARY
+category  "i18n:2012";LC_MESSAGES
+category  "i18n:2012";LC_PAPER
+category  "i18n:2012";LC_MEASUREMENT
+category  "i18n:2012";LC_NAME
+category  "i18n:2012";LC_ADDRESS
+category  "i18n:2012";LC_TELEPHONE
+END LC_IDENTIFICATION
+
+LC_CTYPE
+copy "ru_RU"
+END LC_CTYPE
+
+LC_COLLATE
+copy "iso14651_t1"
+
+% As of July 2018, the Yakut collating rules are not provided by CLDR.
+% This content is based on the alphabet from Wikipedia:
+% https://en.wikipedia.org/wiki/Yakut_language#Writing_system
+%
+% А Б В Г Ҕ Д Дь Е Ё Ж З И Й К Л М Н Ҥ Нь О Ө П Р С Һ Т У Ү Ф Х Ц Ч Ш Щ
+% Ъ Ы Ь Э Ю Я
+
+collating-element <d-soft-sign> from "<U0434><U044C>"
+collating-element <d-Soft-sign> from "<U0434><U042C>"
+collating-element <D-soft-sign> from "<U0414><U044C>"
+collating-element <D-Soft-sign> from "<U0414><U042C>"
+collating-element <n-soft-sign> from "<U043D><U044C>"
+collating-element <n-Soft-sign> from "<U043D><U042C>"
+collating-element <N-soft-sign> from "<U041D><U044C>"
+collating-element <N-Soft-sign> from "<U041D><U042C>"
+
+collating-symbol <yak-ghe>
+collating-symbol <yak-eng>
+collating-symbol <yak-oe>
+collating-symbol <yak-he>
+collating-symbol <yak-ue>
+
+collating-symbol <d-soft-sign-digraph>
+collating-symbol <n-soft-sign-digraph>
+
+% Ҕ after Г and after Ӷ
+reorder-after <S04F7> % CYRILLIC SMALL LETTER GHE WITH DESCENDER
+<yak-ghe>
+
+% Дь after Д and after Ԃ
+reorder-after <S0503> % CYRILLIC SMALL LETTER KOMI DJE
+<d-soft-sign-digraph>
+
+% Ҥ after Н and after Ԋ
+reorder-after <S050B> % CYRILLIC SMALL LETTER KOMI NJE
+<yak-eng>
+% Followed by Нь
+<n-soft-sign-digraph>
+
+% Ө after О
+% This is already default in iso14651_t1_common, no reorder needed.
+
+% Һ after С and after Ҫ
+reorder-after <S04AB> % CYRILLIC SMALL LETTER ES WITH DESCENDER
+<yak-he>
+
+% Ү after У and after Ӯ but before Ұ, Ꙋ, and Ѹ
+% This is already default in iso14651_t1_common, better leave unchanged.
+
+<U0495> <yak-ghe>;<BASE>;<MIN>;IGNORE % ҕ
+<U0494> <yak-ghe>;<BASE>;<CAP>;IGNORE % Ҕ
+<U04A5> <yak-eng>;<BASE>;<MIN>;IGNORE % ҥ
+<U04A4> <yak-eng>;<BASE>;<CAP>;IGNORE % Ҥ
+<U04BB> <yak-he>;<BASE>;<MIN>;IGNORE % һ
+<U04BA> <yak-he>;<BASE>;<CAP>;IGNORE % Һ
+
+<d-soft-sign> <d-soft-sign-digraph>;<BASE>;"<MIN><MIN>";IGNORE % дь
+<d-Soft-sign> <d-soft-sign-digraph>;<BASE>;"<MIN><CAP>";IGNORE % дЬ
+<D-soft-sign> <d-soft-sign-digraph>;<BASE>;"<CAP><MIN>";IGNORE % Дь
+<D-Soft-sign> <d-soft-sign-digraph>;<BASE>;"<CAP><CAP>";IGNORE % ДЬ
+<n-soft-sign> <n-soft-sign-digraph>;<BASE>;"<MIN><MIN>";IGNORE % нь
+<n-Soft-sign> <n-soft-sign-digraph>;<BASE>;"<MIN><CAP>";IGNORE % нЬ
+<N-soft-sign> <n-soft-sign-digraph>;<BASE>;"<CAP><MIN>";IGNORE % Нь
+<N-Soft-sign> <n-soft-sign-digraph>;<BASE>;"<CAP><CAP>";IGNORE % НЬ
+
+reorder-end
+
+END LC_COLLATE
+
+LC_MONETARY
+copy "ru_RU"
+END LC_MONETARY
+
+LC_NUMERIC
+copy "ru_RU"
+END LC_NUMERIC
+
+LC_TIME
+% abday - The abbreviations for the week days:
+% - бс, бн, оп, ср, чп, бт, сб
+abday       "<U0431><U0441>";"<U0431><U043D>";/
+            "<U043E><U043F>";"<U0441><U0440>";/
+            "<U0447><U043F>";"<U0431><U0442>";/
+            "<U0441><U0431>"
+
+% day - The full names of the week days:
+% - баскыһыанньа, бэнидиэнньик, оптуорунньук,
+%   сэрэдэ, чэппиэр, бээтинсэ, субуота
+day         "<U0431><U0430><U0441><U043A><U044B><U04BB><U044B><U0430><U043D><U043D><U044C><U0430>";/
+            "<U0431><U044D><U043D><U0438><U0434><U0438><U044D><U043D><U043D><U044C><U0438><U043A>";/
+            "<U043E><U043F><U0442><U0443><U043E><U0440><U0443><U043D><U043D><U044C><U0443><U043A>";/
+            "<U0441><U044D><U0440><U044D><U0434><U044D>";/
+            "<U0447><U044D><U043F><U043F><U0438><U044D><U0440>";/
+            "<U0431><U044D><U044D><U0442><U0438><U043D><U0441><U044D>";/
+            "<U0441><U0443><U0431><U0443><U043E><U0442><U0430>"
+
+% abmon - The abbreviations for the months
+% - тохс, олун, кул, муус, ыам, бэс, от, атыр, бал, алт, сэт, ахс
+abmon       "<U0442><U043E><U0445><U0441>";"<U043E><U043B><U0443><U043D>";/
+            "<U043A><U0443><U043B>";"<U043C><U0443><U0443><U0441>";/
+            "<U044B><U0430><U043C>";"<U0431><U044D><U0441>";/
+            "<U043E><U0442>";"<U0430><U0442><U044B><U0440>";/
+            "<U0431><U0430><U043B>";"<U0430><U043B><U0442>";/
+            "<U0441><U044D><U0442>";"<U0430><U0445><U0441>"
+
+% mon - The full names of the months (genitive case) -
+% - тохсунньу, олунньу, кулун тутар, муус устар, ыам ыйын,
+%   бэс ыйын, от ыйын, атырдьах ыйын, балаҕан ыйын, алтынньы,
+%   сэтинньи, ахсынньы
+mon         "<U0442><U043E><U0445><U0441><U0443><U043D><U043D><U044C><U0443>";/
+            "<U043E><U043B><U0443><U043D><U043D><U044C><U0443>";/
+            "<U043A><U0443><U043B><U0443><U043D> <U0442><U0443><U0442><U0430><U0440>";/
+            "<U043C><U0443><U0443><U0441> <U0443><U0441><U0442><U0430><U0440>";/
+            "<U044B><U0430><U043C> <U044B><U0439><U044B><U043D>";/
+            "<U0431><U044D><U0441> <U044B><U0439><U044B><U043D>";/
+            "<U043E><U0442> <U044B><U0439><U044B><U043D>";/
+            "<U0430><U0442><U044B><U0440><U0434><U044C><U0430><U0445> <U044B><U0439><U044B><U043D>";/
+            "<U0431><U0430><U043B><U0430><U0495><U0430><U043D> <U044B><U0439><U044B><U043D>";/
+            "<U0430><U043B><U0442><U044B><U043D><U043D><U044C><U044B>";/
+            "<U0441><U044D><U0442><U0438><U043D><U043D><U044C><U0438>";/
+            "<U0430><U0445><U0441><U044B><U043D><U043D><U044C><U044B>"
+
+% alt_mon - The full names of the months (nominative case) -
+% - тохсунньу, олунньу, кулун тутар, муус устар, ыам ыйа,
+%   бэс ыйа, от ыйа, атырдьах ыйа, балаҕан ыйа, алтынньы,
+%   сэтинньи, ахсынньы
+alt_mon     "<U0442><U043E><U0445><U0441><U0443><U043D><U043D><U044C><U0443>";/
+            "<U043E><U043B><U0443><U043D><U043D><U044C><U0443>";/
+            "<U043A><U0443><U043B><U0443><U043D> <U0442><U0443><U0442><U0430><U0440>";/
+            "<U043C><U0443><U0443><U0441> <U0443><U0441><U0442><U0430><U0440>";/
+            "<U044B><U0430><U043C> <U044B><U0439><U0430>";/
+            "<U0431><U044D><U0441> <U044B><U0439><U0430>";/
+            "<U043E><U0442> <U044B><U0439><U0430>";/
+            "<U0430><U0442><U044B><U0440><U0434><U044C><U0430><U0445> <U044B><U0439><U0430>";/
+            "<U0431><U0430><U043B><U0430><U0495><U0430><U043D> <U044B><U0439><U0430>";/
+            "<U0430><U043B><U0442><U044B><U043D><U043D><U044C><U044B>";/
+            "<U0441><U044D><U0442><U0438><U043D><U043D><U044C><U0438>";/
+            "<U0430><U0445><U0441><U044B><U043D><U043D><U044C><U044B>"
+
+% Abbreviated date and time representation to be referenced by the "%c" field
+% descriptor -
+%
+% "%a" (short weekday name),
+% "%Y" (year with century as a decimal number),
+% "%b" (short month name),
+% "%d" (day of month as a decimal number),
+% "%T" (24-hour clock time in format HH:MM:SS),
+% "%Z" (Time zone name)
+d_t_fmt "%a %Y %b %d %T (%Z)"
+
+% Date representation to be referenced by the "%x" field descriptor -
+% "%d/%m/%Y", day/month/year as decimal numbers (01/01/2000).
+d_fmt       "%Y.%m.%d"
+
+% Time representation to be referenced by the "%X" field descriptor -
+% "%T" (24-hour clock time in format HH:MM:SS)
+t_fmt       "%T"
+
+% Define representation of ante meridiem and post meridiem strings -
+% The "" mean default to "AM" and "PM".
+am_pm       "";""
+
+% Define time representation in 12-hour format with "am_pm", to be referenced by the "%r"
+% The "" means that this format is not supported.
+t_fmt_ampm  ""
+
+% Date representation not described in ISO/IEC 14652. Comes out as -
+% "%a %b %e %H:%M:%S %Z %Y" which is default "date" command output
+%
+% %a - abbreviated weekday name,
+% %Y - year with century as a decimal number,e.g. 2001.
+% %e - day of month as a decimal number with leading space (1 to 31),
+% %B - full month name,
+% %T - (24-hour clock time in format HH:MM:SS),
+% %Z - time-zone name,
+% "%a %Y с. %B %e к. %T (%Z)"
+date_fmt "%a %Y <U0441>. %B %e <U043A>. %T (%Z)"
+
+week    7;19971130;1
+first_weekday 2
+first_workday 2
+END LC_TIME
+
+LC_MESSAGES
+% The affirmative response -
+% '^[+1yYдДэЭ]'
+yesexpr "^[+1yY<U0434><U0414><U044D><U042D>]"
+
+% The negative response -
+% '^[-0nNнНсС]'
+noexpr "^[-0nN<U043D><U041D><U0441><U0421>]"
+
+% 'ээх'
+yesstr "<U044D><U044D><U0445>"
+
+% 'суох'
+nostr "<U0441><U0443><U043E><U0445>"
+END LC_MESSAGES
+
+LC_PAPER
+copy "ru_RU"
+END LC_PAPER
+
+LC_TELEPHONE
+copy "ru_RU"
+END LC_TELEPHONE
+
+LC_MEASUREMENT
+copy "ru_RU"
+END LC_MEASUREMENT
+
+LC_NAME
+copy "ru_RU"
+END LC_NAME
+
+LC_ADDRESS
+postal_fmt    "%f%N%a%N%d%N%b%N%s %h %e %r%N%z %T%N%c%N"
+
+% Россия
+country_name  "<U0420><U043E><U0441><U0441><U0438><U044F>"
+
+% Саха тыла
+lang_name     "<U0421><U0430><U0445><U0430> <U0442><U044B><U043B><U0430>"
+
+% UN Geneve 1949:68 Distinguishing signs of vehicles in international traffic
+% RUS
+country_car    "RUS"
+
+% ISO 639 language abbreviations:
+% 639-1 2 letter, 639-2 3 letter terminology
+% (empty), sah, sah
+lang_ab       ""
+lang_term     "sah"
+lang_lib      "sah"
+
+% ISO 3166 country number and 2 and 3 letter abreviations
+% RU, RUS
+country_ab2   "RU"
+country_ab3   "RUS"
+country_num   643
+
+END LC_ADDRESS
diff --git a/localedata/sah_RU.UTF-8.in b/localedata/sah_RU.UTF-8.in
new file mode 100644
index 0000000..324aaf9
--- /dev/null
+++ b/localedata/sah_RU.UTF-8.in
@@ -0,0 +1,84 @@
+а
+А
+б
+Б
+в
+В
+г
+Г
+ҕ
+Ҕ
+д
+Д
+дэ
+дя
+дь
+Дь
+е
+Е
+ё
+Ё
+ж
+Ж
+з
+З
+и
+И
+й
+Й
+к
+К
+л
+Л
+м
+М
+н
+Н
+нэ
+ня
+ҥ
+Ҥ
+нь
+Нь
+о
+О
+ө
+Ө
+п
+П
+р
+Р
+с
+С
+һ
+Һ
+т
+Т
+у
+У
+ү
+Ү
+ф
+Ф
+х
+Х
+ц
+Ц
+ч
+Ч
+ш
+Ш
+щ
+Щ
+ъ
+Ъ
+ы
+Ы
+ь
+Ь
+э
+Э
+ю
+Ю
+я
+Я
-- 
2.7.5


^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [PATCH] New locale: Yakut (Sakha) for Russia (sah_RU) [BZ #22241]
  2018-07-16 19:21 [PATCH] New locale: Yakut (Sakha) for Russia (sah_RU) [BZ #22241] Rafal Luzynski
@ 2018-07-18  0:24 ` Carlos O'Donell
  2018-07-18  9:52   ` Rafal Luzynski
  0 siblings, 1 reply; 3+ messages in thread
From: Carlos O'Donell @ 2018-07-18  0:24 UTC (permalink / raw)
  To: Rafal Luzynski, libc-alpha

On 07/16/2018 03:20 PM, Rafal Luzynski wrote:
> This is approved by Mike [1] but as we are in a slushy freeze period
> I'd like to have more approvals before pushing.  OK for 2.28?  Carlos?
> Anybody?
> 
> Regards,
> 
> Rafal
> 
> 
> [1] https://sourceware.org/bugzilla/show_bug.cgi?id=22241#c17
> 

OK for 2.28. Please commit.

Adds a new collation test. And approval from Mike is good for me.

Reviewed-by: Carlos O'Donell <carlos@redhat.com>

-- 
Cheers,
Carlos.

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [PATCH] New locale: Yakut (Sakha) for Russia (sah_RU) [BZ #22241]
  2018-07-18  0:24 ` Carlos O'Donell
@ 2018-07-18  9:52   ` Rafal Luzynski
  0 siblings, 0 replies; 3+ messages in thread
From: Rafal Luzynski @ 2018-07-18  9:52 UTC (permalink / raw)
  To: libc-alpha, Carlos O'Donell

18.07.2018 02:24 Carlos O'Donell <carlos@redhat.com> wrote:
>
> On 07/16/2018 03:20 PM, Rafal Luzynski wrote:
> > This is approved by Mike [1] but as we are in a slushy freeze period
> > I'd like to have more approvals before pushing. OK for 2.28? Carlos?
> > Anybody?
> >
> > Regards,
> >
> > Rafal
> >
> >
> > [1] https://sourceware.org/bugzilla/show_bug.cgi?id=22241#c17
> >
>
> OK for 2.28. Please commit.

Thank you.  This patch has been committed now.

> Adds a new collation test. And approval from Mike is good for me.

Yes, I had some problems with collation in sah_RU and having similar
problems with dsb_DE recently I tested and consulted this with Mike
multiple times.

Regards,

Rafal

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2018-07-18  9:52 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-07-16 19:21 [PATCH] New locale: Yakut (Sakha) for Russia (sah_RU) [BZ #22241] Rafal Luzynski
2018-07-18  0:24 ` Carlos O'Donell
2018-07-18  9:52   ` Rafal Luzynski

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).