public inbox for glibc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug localedata/23048] iconv: add more macintosh tables and aliases
       [not found] <bug-23048-131@http.sourceware.org/bugzilla/>
@ 2023-03-21 21:04 ` jep at webfactory dot de
  2023-03-22  9:13 ` jep at webfactory dot de
                   ` (3 subsequent siblings)
  4 siblings, 0 replies; 5+ messages in thread
From: jep at webfactory dot de @ 2023-03-21 21:04 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=23048

Jascha Eliano Paetzold <jep at webfactory dot de> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |jep at webfactory dot de

--- Comment #5 from Jascha Eliano Paetzold <jep at webfactory dot de> ---
https://www.gnu.org/software/libiconv supports all these encodings according to
its website. Maybe the implementation over there could be used as a reference?

Apart from that, Wikipedia lists the following secondary/primary sources:

mac-roman:
https://developer.apple.com/library/archive/documentation/mac/pdf/Text.pdf#page=89
mac-greek: https://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/GREEK.TXT
mac-centraleurope:
https://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/CENTEURO.TXT
mac-iceland: https://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/ICELAND.TXT
mac-turkish: https://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/TURKISH.TXT

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* [Bug localedata/23048] iconv: add more macintosh tables and aliases
       [not found] <bug-23048-131@http.sourceware.org/bugzilla/>
  2023-03-21 21:04 ` [Bug localedata/23048] iconv: add more macintosh tables and aliases jep at webfactory dot de
@ 2023-03-22  9:13 ` jep at webfactory dot de
  2023-04-03 14:18 ` fweimer at redhat dot com
                   ` (2 subsequent siblings)
  4 siblings, 0 replies; 5+ messages in thread
From: jep at webfactory dot de @ 2023-03-22  9:13 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=23048

--- Comment #6 from Jascha Eliano Paetzold <jep at webfactory dot de> ---
(In reply to Florian Weimer from comment #3)

Would you trust me if I verified these tables work on an Apple machine?

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* [Bug localedata/23048] iconv: add more macintosh tables and aliases
       [not found] <bug-23048-131@http.sourceware.org/bugzilla/>
  2023-03-21 21:04 ` [Bug localedata/23048] iconv: add more macintosh tables and aliases jep at webfactory dot de
  2023-03-22  9:13 ` jep at webfactory dot de
@ 2023-04-03 14:18 ` fweimer at redhat dot com
  2023-05-23 11:07 ` jep at webfactory dot de
  2023-05-23 11:48 ` fweimer at redhat dot com
  4 siblings, 0 replies; 5+ messages in thread
From: fweimer at redhat dot com @ 2023-04-03 14:18 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=23048

--- Comment #7 from Florian Weimer <fweimer at redhat dot com> ---
(In reply to Jascha Eliano Paetzold from comment #6)
> (In reply to Florian Weimer from comment #3)
> 
> Would you trust me if I verified these tables work on an Apple machine?

I think we could use iconv (the function) on macOS to enumerate the 8-bit bit
range, and then double-check the 20.1-bit Unicode space for any unexpected
mappings in the other direction. That would be enough verification for me.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* [Bug localedata/23048] iconv: add more macintosh tables and aliases
       [not found] <bug-23048-131@http.sourceware.org/bugzilla/>
                   ` (2 preceding siblings ...)
  2023-04-03 14:18 ` fweimer at redhat dot com
@ 2023-05-23 11:07 ` jep at webfactory dot de
  2023-05-23 11:48 ` fweimer at redhat dot com
  4 siblings, 0 replies; 5+ messages in thread
From: jep at webfactory dot de @ 2023-05-23 11:07 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=23048

--- Comment #8 from Jascha Eliano Paetzold <jep at webfactory dot de> ---
(In reply to Florian Weimer from comment #7)
> (In reply to Jascha Eliano Paetzold from comment #6)
> > (In reply to Florian Weimer from comment #3)
> > 
> > Would you trust me if I verified these tables work on an Apple machine?
> 
> I think we could use iconv (the function) on macOS to enumerate the 8-bit
> bit range, and then double-check the 20.1-bit Unicode space for any
> unexpected mappings in the other direction. That would be enough
> verification for me.

Thanks for your offer to collaborate on that topic!

I have little to no experience with encodings, but I assume that it would be
sufficient for me to generate a text file spanning all possible characters in
the 8 bit range and pass it to the iconv command in the MacOS shell (setting it
to convert from unicode to mac-greek etc.) and then post the output here?

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* [Bug localedata/23048] iconv: add more macintosh tables and aliases
       [not found] <bug-23048-131@http.sourceware.org/bugzilla/>
                   ` (3 preceding siblings ...)
  2023-05-23 11:07 ` jep at webfactory dot de
@ 2023-05-23 11:48 ` fweimer at redhat dot com
  4 siblings, 0 replies; 5+ messages in thread
From: fweimer at redhat dot com @ 2023-05-23 11:48 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=23048

--- Comment #9 from Florian Weimer <fweimer at redhat dot com> ---
(In reply to Jascha Eliano Paetzold from comment #8)
> I have little to no experience with encodings, but I assume that it would be
> sufficient for me to generate a text file spanning all possible characters
> in the 8 bit range and pass it to the iconv command in the MacOS shell
> (setting it to convert from unicode to mac-greek etc.) and then post the
> output here?

For 8-bit input lie that, you'd have to convert *from* mac-greek.

For conversion to mac-greek, you'd have to enumerate all Unicode codepoints (or
maybe just the BMP), and somehow skip over unconvertable characters. That is
probably best done programmatically because iconv (the shell command) may not
provide enough information for skipped/non-convertable Unicode codepoints.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2023-05-23 11:48 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <bug-23048-131@http.sourceware.org/bugzilla/>
2023-03-21 21:04 ` [Bug localedata/23048] iconv: add more macintosh tables and aliases jep at webfactory dot de
2023-03-22  9:13 ` jep at webfactory dot de
2023-04-03 14:18 ` fweimer at redhat dot com
2023-05-23 11:07 ` jep at webfactory dot de
2023-05-23 11:48 ` fweimer at redhat dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).