From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 18316 invoked by alias); 22 Nov 2007 15:06:53 -0000 Received: (qmail 18306 invoked by uid 22791); 22 Nov 2007 15:06:52 -0000 X-Spam-Check-By: sourceware.org Received: from mx1.redhat.com (HELO mx1.redhat.com) (66.187.233.31) by sourceware.org (qpsmtpd/0.31) with ESMTP; Thu, 22 Nov 2007 15:06:48 +0000 Received: from int-mx1.corp.redhat.com (int-mx1.corp.redhat.com [172.16.52.254]) by mx1.redhat.com (8.13.8/8.13.1) with ESMTP id lAMF6htk019922; Thu, 22 Nov 2007 10:06:43 -0500 Received: from pobox-2.corp.redhat.com (pobox-2.corp.redhat.com [10.11.255.15]) by int-mx1.corp.redhat.com (8.13.1/8.13.1) with ESMTP id lAMF6gOL016152; Thu, 22 Nov 2007 10:06:42 -0500 Received: from [10.34.33.219] (dhcp-lab-219.englab.brq.redhat.com [10.34.33.219]) by pobox-2.corp.redhat.com (8.13.1/8.13.1) with ESMTP id lAMF6fhn010448; Thu, 22 Nov 2007 10:06:42 -0500 Subject: Re: bug in docbook-tools text backend using w3m From: =?UTF-8?Q?Ond=C5=99ej_Va=C5=A1=C3=ADk?= To: Christian =?ISO-8859-1?Q?B=FCnnig?= Cc: docbook-tools-discuss@sources.redhat.com In-Reply-To: <1195223148.10704.22.camel@ume> References: <1195223148.10704.22.camel@ume> Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="=-mBbeWeAQ0FdgXTeHg1wi" Date: Thu, 22 Nov 2007 15:06:00 -0000 Message-Id: <1195743882.3846.6.camel@dhcp-lab-219.englab.brq.redhat.com> Mime-Version: 1.0 X-Mailer: Evolution 2.8.3 (2.8.3-2.fc6) X-IsSubscribed: yes Mailing-List: contact docbook-tools-discuss-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: docbook-tools-discuss-owner@sourceware.org X-SW-Source: 2007/txt/msg00010.txt.bz2 --=-mBbeWeAQ0FdgXTeHg1wi Content-Type: multipart/mixed; boundary="=-iLfnO7yoGTqj1UA4g5dr" --=-iLfnO7yoGTqj1UA4g5dr Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Content-length: 985 Hi, Actually I would prefer to modify ARGS of w3m as is suggested in man pages and as is done in xmlto docbook txt formatter. (done in attached patch - I'll commit that one to Fedora) About Eric Bischoffs current address - after a bit of google searching, it seems that now he is using ebischoff@nerim.net - but nothing guaranteed. Greetings, Ondrej Vasik Christian B=C3=BCnnig wrote: > Hey, >=20 > I've experienced a bug in the text backend of the docbook tools. The bug > is about converting from HTML to TEXT with w3m. The backend creates a > temporary HTML file to make a TEXT file from. However, the temporary > HTML has no '.html' suffix and in that case w3m does not convert it to > plain text - the resulting .txt is still HTML. >=20 > Below is a workaround. I just appended '.html' to the variable HTML > (line 22). Now it works. >=20 > Btw .. the e-mail address seems to be not valid > anymore. >=20 > Regards, >=20 > Christian >=20 >=20 --=-iLfnO7yoGTqj1UA4g5dr Content-Disposition: attachment; filename=docbook-utils-w3mtxtconvert.patch Content-Type: text/x-patch; name=docbook-utils-w3mtxtconvert.patch; charset=UTF-8 Content-Transfer-Encoding: base64 Content-length: 484 ZGlmZiAtdXJOcCBvcmlnaW5hbC90eHQgbmV3L3R4dA0KLS0tIG9yaWdpbmFs L2JhY2tlbmRzL3R4dAkyMDA3LTExLTA1IDE4OjQ0OjUyLjAwMDAwMDAwMCAr MDEwMA0KKysrIG5ldy9iYWNrZW5kcy90eHQJMjAwNy0xMS0yMiAxNToyMToz Ni4wMDAwMDAwMDAgKzAxMDANCkBAIC0xMyw3ICsxMyw3IEBAIHRoZW4NCiBl bGlmIFsgLXggL3Vzci9iaW4vdzNtIF0NCiB0aGVuDQogICBDT05WRVJUPS91 c3IvYmluL3czbQ0KLSAgQVJHUz0iLWR1bXAiDQorICBBUkdTPSItVCB0ZXh0 L2h0bWwgLWR1bXAiDQogZWxzZQ0KICAgZWNobyA+JjIgIk5vIHdheSB0byBj b252ZXJ0IEhUTUwgdG8gdGV4dCBmb3VuZC4iDQogICBleGl0IDENCg== --=-iLfnO7yoGTqj1UA4g5dr-- --=-mBbeWeAQ0FdgXTeHg1wi Content-Type: application/pgp-signature; name=signature.asc Content-Description: Toto je =?UTF-8?Q?digit=C3=A1ln=C4=9B?= =?ISO-8859-1?Q?_podepsan=E1?= =?UTF-8?Q?_=C4=8D=C3=A1st?= =?ISO-8859-1?Q?_zpr=E1vy?= Content-length: 189 -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.7 (GNU/Linux) iD8DBQBHRZqBHas98eQiYDQRAkifAJ0eVdQh3CHl2PfC4625qJsgBCHaTwCfZ46S X8JctY7mT1+cFYwDfOn7d1E= =vM+t -----END PGP SIGNATURE----- --=-mBbeWeAQ0FdgXTeHg1wi--