From: "Raif S. Naffah" <raif@fl.net.au>
To: Mark Wielaard <mark@klomp.org>
Cc: Mauve <mauve-discuss@sources.redhat.com>
Subject: Re: new test cases (long)
Date: Fri, 14 Feb 2003 23:58:00 -0000 [thread overview]
Message-ID: <200302151101.21183.raif@fl.net.au> (raw)
In-Reply-To: <1045223836.30202.326.camel@elsschot>
-----BEGIN PGP SIGNED MESSAGE-----
Hash: RIPEMD160
hello Mark,
On Friday 14 February 2003 22:57, Mark Wielaard wrote:
> Hi Raif,
>
> On Sat, 2003-02-08 at 17:17, Raif S. Naffah wrote:
> > the tests are to ensure that the mandated (as per public Javadoc
> > 1.3.1 and 1.4.1) minimal character encodings are supported by the
> > bytecode interpreter.
> > [...]
> > + * gnu/testlet/java/lang/String/getBytes14: new test
>
> Here you test for "ISO8859_15". I looked here:
> http://java.sun.com/j2se/1.4.1/docs/api/java/nio/charset/Charset.html
> but couldn't see where it said this is a required character set.
yes it is not listed there. but i refer you to
<.../j2sdk1.4.1/docs/guide/intl/encoding.doc.html> page of the public
documentation of sun's jdk-1.4.1; 2nd paragraph:
"Sun's Java 2 Software Development Kit, Standard Edition, v. 1.4.1 for
all platforms (SolarisTM operating environment, Linux, and Microsoft
Windows) and the Java 2 Runtime Environment, Standard Edition, v. 1.4.1
for Solaris and Linux support all encodings shown on this page..."
and further down the same page, a table giving the "Basic Encoding Set
(contained in lib/rt.jar) - Supported by java.nio, java.io and
java.lang APIs." in the "Canonical Name for java.io and java.lang
API," column, next to ISO-8859-15 row entry, there is a reference to
"extended encoding set." i took this to mean the value of the
canonical name to be taken from the second set; ie. the extended
encoding set."
there are two possible deductions from this page:
a. "ISO-8859-15" is a MUST encoding in java.nio, as well as in java.io
and java.lang, but in the last two the canonical name is as stated in
the "extended set" i.e. "ISO8859_15" (ISO 8859-15, Latin alphabet No. 9
(and hence the supporting classes are in charsets.jar rather than in
rt.jar).
b. "ISO-8859-15" is only a MUST encoding in java.nio, but not in java.io
nor java.lang.
i adopted the first.
there is of course the 3rd possibility of the writer(s) of these
documentation pages being in contradiction.
the code itself (for the sun's jdk 1.4.1_01) does support ISO-8859-15,
which can be thought of as the lithmus test.
> Is it really required or just nice to have since the Sun
> implementation supports it? (Which might still be a good reason to
> add them to Mauve, but then I would like to label them explicitly as
> such.)
my interpretation of it was that is is a MUST.
> Also you seem to test (in getBytes13) for the "historical names" for
> which I couldn't find a definition.
the relevant javadoc page in sun'd jdk 1.3.1_06
<.../jdk1.3.1/docs/guide/intl/encoding.doc.html> lists the required
encodings:
"...Sun's Java 2 Runtime Environment, Standard Edition, v. 1.3.1 for
Windows comes in two different versions: US-only and international. The
US-only version only supports the encodings shown in the first table.
The international version (which includes the lib\i18n.jar file)
supports all encodings shown on this page."
it then proceeds to list the "Basic Encoding Set" (contained in rt.jar)
where those names are defined.
the only difference is the Latin Alphabet #9.
>... Do you know where they are
> specified? InputStreamReader and OutputStreamWriter getEncoding() are
> supposed to return them but they don't document what they actually
> look like.
the references sun cites are:
* The Unicode standard
<http://www.unicode.org/unicode/standard/standard.html>, and
* The Unicode FAQ <http://www.unicode.org/unicode/faq>.
cheers;
rsn
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.0.7 (GNU/Linux)
Comment: Que du magnifique
iD8DBQE+TYNP+e1AKnsTRiERA/LLAKCrscVg83sy882JsHImp/ybGSipCgCgquAV
4VLd68TlTPVpPV5w296qSCc=
=X4EK
-----END PGP SIGNATURE-----
next prev parent reply other threads:[~2003-02-14 23:58 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2003-02-08 16:15 Raif S. Naffah
2003-02-14 11:57 ` Mark Wielaard
2003-02-14 23:58 ` Raif S. Naffah [this message]
2003-02-16 11:54 ` Mark Wielaard
2003-02-16 16:44 ` Raif S. Naffah
2003-02-16 17:21 ` Mark Wielaard
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=200302151101.21183.raif@fl.net.au \
--to=raif@fl.net.au \
--cc=mark@klomp.org \
--cc=mauve-discuss@sources.redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).