public inbox for glibc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug localedata/4024] collation in pinyin for zh_CN locale
       [not found] <bug-4024-131@http.sourceware.org/bugzilla/>
@ 2014-02-16 18:29 ` jackie.rosen at hushmail dot com
  2014-05-28 19:44 ` schwab at sourceware dot org
  1 sibling, 0 replies; 9+ messages in thread
From: jackie.rosen at hushmail dot com @ 2014-02-16 18:29 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=4024

Jackie Rosen <jackie.rosen at hushmail dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |jackie.rosen at hushmail dot com

--- Comment #8 from Jackie Rosen <jackie.rosen at hushmail dot com> ---
*** Bug 260998 has been marked as a duplicate of this bug. ***
Seen from the domain http://volichat.com
Page where seen: http://volichat.com/adult-chat-rooms
Marked for reference. Resolved as fixed @bugzilla.

-- 
You are receiving this mail because:
You are on the CC list for the bug.


^ permalink raw reply	[flat|nested] 9+ messages in thread

* [Bug localedata/4024] collation in pinyin for zh_CN locale
       [not found] <bug-4024-131@http.sourceware.org/bugzilla/>
  2014-02-16 18:29 ` [Bug localedata/4024] collation in pinyin for zh_CN locale jackie.rosen at hushmail dot com
@ 2014-05-28 19:44 ` schwab at sourceware dot org
  1 sibling, 0 replies; 9+ messages in thread
From: schwab at sourceware dot org @ 2014-05-28 19:44 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=4024

Andreas Schwab <schwab at sourceware dot org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|jackie.rosen at hushmail dot com   |

-- 
You are receiving this mail because:
You are on the CC list for the bug.


^ permalink raw reply	[flat|nested] 9+ messages in thread

* [Bug localedata/4024] collation in pinyin for zh_CN locale
  2007-02-11 11:19 [Bug localedata/4024] New: " fundawang at gmail dot com
                   ` (5 preceding siblings ...)
  2007-03-23  6:11 ` fundawang at gmail dot com
@ 2007-04-28  6:53 ` drepper at redhat dot com
  6 siblings, 0 replies; 9+ messages in thread
From: drepper at redhat dot com @ 2007-04-28  6:53 UTC (permalink / raw)
  To: glibc-bugs


------- Additional Comments From drepper at redhat dot com  2007-04-28 07:52 -------
You didn't change the state of the bug.  So don't complain if nobody notices it.

I have checked in a different patch.  Your collation file duplicates a lot of
information from the normal iso14651_t1 file.  That's no good.  The setup I have
now allows sharing the data.

-- 
           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|WAITING                     |RESOLVED
         Resolution|                            |FIXED


http://sourceware.org/bugzilla/show_bug.cgi?id=4024

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.


^ permalink raw reply	[flat|nested] 9+ messages in thread

* [Bug localedata/4024] collation in pinyin for zh_CN locale
  2007-02-11 11:19 [Bug localedata/4024] New: " fundawang at gmail dot com
                   ` (4 preceding siblings ...)
  2007-02-17 16:46 ` fundawang at gmail dot com
@ 2007-03-23  6:11 ` fundawang at gmail dot com
  2007-04-28  6:53 ` drepper at redhat dot com
  6 siblings, 0 replies; 9+ messages in thread
From: fundawang at gmail dot com @ 2007-03-23  6:11 UTC (permalink / raw)
  To: glibc-bugs


------- Additional Comments From fundawang at gmail dot com  2007-03-23 06:11 -------
How is this bug going?

-- 


http://sourceware.org/bugzilla/show_bug.cgi?id=4024

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.


^ permalink raw reply	[flat|nested] 9+ messages in thread

* [Bug localedata/4024] collation in pinyin for zh_CN locale
  2007-02-11 11:19 [Bug localedata/4024] New: " fundawang at gmail dot com
                   ` (3 preceding siblings ...)
  2007-02-17 15:24 ` ed dot trager at gmail dot com
@ 2007-02-17 16:46 ` fundawang at gmail dot com
  2007-03-23  6:11 ` fundawang at gmail dot com
  2007-04-28  6:53 ` drepper at redhat dot com
  6 siblings, 0 replies; 9+ messages in thread
From: fundawang at gmail dot com @ 2007-02-17 16:46 UTC (permalink / raw)
  To: glibc-bugs


------- Additional Comments From fundawang at gmail dot com  2007-02-17 16:46 -------
> So does this mean you are going to order based on character
> usage frequency within pinyin+tone category?
Correctly.

In fact, most Chinese users only care about the characters that have same 
pronunciations should be sorted together, the inner order of pinyin+tone is 
not so that important.

-- 


http://sourceware.org/bugzilla/show_bug.cgi?id=4024

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.


^ permalink raw reply	[flat|nested] 9+ messages in thread

* [Bug localedata/4024] collation in pinyin for zh_CN locale
  2007-02-11 11:19 [Bug localedata/4024] New: " fundawang at gmail dot com
                   ` (2 preceding siblings ...)
  2007-02-17  7:44 ` fundawang at gmail dot com
@ 2007-02-17 15:24 ` ed dot trager at gmail dot com
  2007-02-17 16:46 ` fundawang at gmail dot com
                   ` (2 subsequent siblings)
  6 siblings, 0 replies; 9+ messages in thread
From: ed dot trager at gmail dot com @ 2007-02-17 15:24 UTC (permalink / raw)
  To: glibc-bugs

[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain, Size: 2493 bytes --]


------- Additional Comments From ed dot trager at gmail dot com  2007-02-17 15:24 -------
Subject: Re:  collation in pinyin for zh_CN locale

Pinyin collation for zh_CN as the default will be great -- In fact, I
am surprised to learn it isn't done that way already!

I do have a question though:  What is the order of characters nested
within a given pinyin+tone category?  For example, is this going to
follow the standard order of one of the big dictionaries?  Or
something else?

My copy of "The Pinyin Chinese-English Dictionary 汉英词典" (Wu Jingrong
ed., Beijing Foreign Languages Institute 1979) seems to order
characters by number of strokes within pinyin category, i.e. "jin1":
巾今斤金津矜 ... etc.  This is one logical way to do it.

But my copy of 现代汉语词典 (中国社会科学院语言研究所 Commercial Press Beijing 1986)
orders "jin1" completely differently: 津禁襟巾今衿矜 ... etc.  I'm not sure
what the logic here is ...

Another logical way to do it would be to order by how frequently the
character is used.  If I remember correctly from an earlier post, the
perl script for generating the locale were pulling data from SCIM
tables.  So does this mean you are going to order based on character
usage frequency within pinyin+tone category?

Best - Ed

On 17 Feb 2007 07:44:43 -0000, fundawang at gmail dot com
<sourceware-bugzilla@sourceware.org> wrote:
>
> ------- Additional Comments From fundawang at gmail dot com  2007-02-17 07:44 -------
> > So, what exactly is the proposal?  Create a new locale zh_CN@pinyin
> > or using the new collation data for zh_CN?  The former sounds much
> > safer to me.
> There will be several collation for Chinese, like pronunciation (pinyin) and
> strokes. The most widely used collation is pinyin acturally. The collation of
> iso14651 is of no use for Chinese.
>
> So, the proposal is replacing current collation for zh_CN (iso14651) to pinyin.
> As for the strokes, we'll likely propose zh_CN@strokes in the future.
>
> --
>
>
> http://sourceware.org/bugzilla/show_bug.cgi?id=4024
>
> ------- You are receiving this mail because: -------
> You are the assignee for the bug, or are watching the assignee.
>


-- 


http://sourceware.org/bugzilla/show_bug.cgi?id=4024

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.


^ permalink raw reply	[flat|nested] 9+ messages in thread

* [Bug localedata/4024] collation in pinyin for zh_CN locale
  2007-02-11 11:19 [Bug localedata/4024] New: " fundawang at gmail dot com
  2007-02-11 11:27 ` [Bug localedata/4024] " fundawang at gmail dot com
  2007-02-17  7:36 ` drepper at redhat dot com
@ 2007-02-17  7:44 ` fundawang at gmail dot com
  2007-02-17 15:24 ` ed dot trager at gmail dot com
                   ` (3 subsequent siblings)
  6 siblings, 0 replies; 9+ messages in thread
From: fundawang at gmail dot com @ 2007-02-17  7:44 UTC (permalink / raw)
  To: glibc-bugs


------- Additional Comments From fundawang at gmail dot com  2007-02-17 07:44 -------
> So, what exactly is the proposal?  Create a new locale zh_CN@pinyin
> or using the new collation data for zh_CN?  The former sounds much
> safer to me.
There will be several collation for Chinese, like pronunciation (pinyin) and 
strokes. The most widely used collation is pinyin acturally. The collation of 
iso14651 is of no use for Chinese.

So, the proposal is replacing current collation for zh_CN (iso14651) to pinyin. 
As for the strokes, we'll likely propose zh_CN@strokes in the future.

-- 


http://sourceware.org/bugzilla/show_bug.cgi?id=4024

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.


^ permalink raw reply	[flat|nested] 9+ messages in thread

* [Bug localedata/4024] collation in pinyin for zh_CN locale
  2007-02-11 11:19 [Bug localedata/4024] New: " fundawang at gmail dot com
  2007-02-11 11:27 ` [Bug localedata/4024] " fundawang at gmail dot com
@ 2007-02-17  7:36 ` drepper at redhat dot com
  2007-02-17  7:44 ` fundawang at gmail dot com
                   ` (4 subsequent siblings)
  6 siblings, 0 replies; 9+ messages in thread
From: drepper at redhat dot com @ 2007-02-17  7:36 UTC (permalink / raw)
  To: glibc-bugs


------- Additional Comments From drepper at redhat dot com  2007-02-17 07:35 -------
So, what exactly is the proposal?  Create a new locale zh_CN@pinyin or using the
new collation data for zh_CN?  The former sounds much safer to me.

-- 
           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |WAITING


http://sourceware.org/bugzilla/show_bug.cgi?id=4024

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.


^ permalink raw reply	[flat|nested] 9+ messages in thread

* [Bug localedata/4024] collation in pinyin for zh_CN locale
  2007-02-11 11:19 [Bug localedata/4024] New: " fundawang at gmail dot com
@ 2007-02-11 11:27 ` fundawang at gmail dot com
  2007-02-17  7:36 ` drepper at redhat dot com
                   ` (5 subsequent siblings)
  6 siblings, 0 replies; 9+ messages in thread
From: fundawang at gmail dot com @ 2007-02-11 11:27 UTC (permalink / raw)
  To: glibc-bugs


------- Additional Comments From fundawang at gmail dot com  2007-02-11 11:27 -------
Created an attachment (id=1547)
 --> (http://sourceware.org/bugzilla/attachment.cgi?id=1547&action=view)
zh_CN@pinyin.gb18030 collate

The attachment is the collation for zh_CN locale. The method for generating
this piece of data could be found at
http://download.gro.clinux.org/fedora/locale-pinyin-0.1.tar.gz

The pinyin_table.txt inside that package is from scim (GPLed IME).
And the gen_pinyin.pl from that package is authored by
hellwolf.misty@gmail.com, which is GPLed also. 


-- 


http://sourceware.org/bugzilla/show_bug.cgi?id=4024

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.


^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2014-05-28 19:44 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <bug-4024-131@http.sourceware.org/bugzilla/>
2014-02-16 18:29 ` [Bug localedata/4024] collation in pinyin for zh_CN locale jackie.rosen at hushmail dot com
2014-05-28 19:44 ` schwab at sourceware dot org
2007-02-11 11:19 [Bug localedata/4024] New: " fundawang at gmail dot com
2007-02-11 11:27 ` [Bug localedata/4024] " fundawang at gmail dot com
2007-02-17  7:36 ` drepper at redhat dot com
2007-02-17  7:44 ` fundawang at gmail dot com
2007-02-17 15:24 ` ed dot trager at gmail dot com
2007-02-17 16:46 ` fundawang at gmail dot com
2007-03-23  6:11 ` fundawang at gmail dot com
2007-04-28  6:53 ` drepper at redhat dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).