From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 16626 invoked by alias); 4 May 2010 09:29:02 -0000 Received: (qmail 16525 invoked by uid 48); 4 May 2010 09:28:47 -0000 Date: Tue, 04 May 2010 09:29:00 -0000 Message-ID: <20100504092847.16524.qmail@sourceware.org> From: "rq at akl dot lt" To: glibc-bugs@sources.redhat.com In-Reply-To: <20100423142455.11532.rq@akl.lt> References: <20100423142455.11532.rq@akl.lt> Reply-To: sourceware-bugzilla@sourceware.org Subject: [Bug localedata/11532] Support old DOS Lithuanian character sets in iconv X-Bugzilla-Reason: CC Mailing-List: contact glibc-bugs-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: glibc-bugs-owner@sourceware.org X-SW-Source: 2010-05/txt/msg00020.txt.bz2 ------- Additional Comments From rq at akl dot lt 2010-05-04 09:28 ------- I've attached five files with mapping tables for each codepage. Their format is: [octal code]: [UTF-8 character] Lower 127 positions (0000-0177) match ASCII in all cases, so only the positions starting 0200 matter. It seems like these charsets are (or maybe were) supported by ICU (see [1]). The page also has some further descriptions that could be used when forming alias names for cp77x charsets: CP770 Lithuanian Standard RST 1095-89 CP771 KBL (Lithuanian and Russian characters) CP772 Lithuanian Standard LST 1284:1993 CP773 Lithuanian (Mix of 771 and 775) CP774 Lithuanian Standard 1283:1993 Unfortunately, I couldn't find source files of ICU mappings of these character sets at [2], so I can't attach them. Instead, I used a small program found at [3], developed a few years ago specifically to act as a converter among different character sets used in Lithuania (note: I changed one symbol in CP770.txt to match with the actual standard). If it's possible to find ICU mappings, I think most likely they should be used as a basis for conversion. Otherwise, the files attached should be fine. [1] http://publib.boulder.ibm.com/infocenter/tivihelp/v24r1/index.jsp?topic=/com.ibm.itcama.doc_6.2.3/itcam_oraclerac63200.htm [2] http://source.icu-project.org/repos/icu/data/trunk/charset/data/ [3] https://www3.mruni.lt/~rims/kodav/#Diegimas -- What |Removed |Added ---------------------------------------------------------------------------- Status|WAITING |NEW http://sourceware.org/bugzilla/show_bug.cgi?id=11532 ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.