From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 61587 invoked by alias); 22 Sep 2015 11:23:45 -0000 Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner@cygwin.com Mail-Followup-To: cygwin@cygwin.com Received: (qmail 61575 invoked by uid 89); 22 Sep 2015 11:23:45 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=4.7 required=5.0 tests=AWL,BAYES_50,CYGWIN_OWNER_BODY,KAM_LAZY_DOMAIN_SECURITY,RCVD_IN_DNSWL_LOW autolearn=no version=3.3.2 X-HELO: mx4.adesso.de Received: from mx4.adesso.de (HELO mx4.adesso.de) (85.22.63.1) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with (AES256-SHA encrypted) ESMTPS; Tue, 22 Sep 2015 11:23:43 +0000 Received: from ex2010-cas02.adesso.local (gw-do-ext.adesso.de [172.20.20.1]) by mx4.adesso.de (8.14.7/8.14.7) with ESMTP id t8MBNa4C000682 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=FAIL); Tue, 22 Sep 2015 13:23:36 +0200 Received: from EX2010-DB01.adesso.local ([10.1.1.215]) by ex2010-cas02.adesso.local ([fe80::a1eb:7970:ce3d:a3cf%11]) with mapi id 14.03.0248.002; Tue, 22 Sep 2015 13:23:36 +0200 From: "Schmitz, Marco" To: Marco Atzeri , "cygwin@cygwin.com" Subject: AW: Tesseract 3.04 - Cygwin64 - Windows 8.1 - Can't open makebox Date: Tue, 22 Sep 2015 11:23:00 -0000 Message-ID: References: <560010D0.4010903@gmail.com> In-Reply-To: <560010D0.4010903@gmail.com> Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-IsSubscribed: yes X-SW-Source: 2015-09/txt/msg00348.txt.bz2 Hi Marco, without setting TESSDATA_PREFIX (neither Windows environment variables nor = .bash_profile) I get: $ tesseract --list-langs Error opening data file C:\DEV\tesseract\Tesseract-OCR\tessdata/eng.trained= data Please make sure the TESSDATA_PREFIX environment variable is set to the par= ent directory of your "tessdata" directory. Failed loading language 'eng' Tesseract couldn't load any languages! Could not initialize tesseract. This is my first problem, which I solved defining TESSDATA_PREFIX (in Windo= ws environment). Now I get: $ tesseract --list-langs List of available languages (13): arbeitsunfaehigkeit deu deu_frak eng fra ita ita_old nld osd por spa spa_old vie Then I try this: $ tesseract arbeitsunfaehigkeit.hausarzt.exp0.jpg arbeitsunfaehigkeit batch= .nochop makebox Tesseract Open Source OCR Engine v3.04.00 with Leptonica Okay, but originally I wrote this issue because I tried to call it from a s= hell script. So, this is my box.sh: #!/usr/bin/env bash tesseract arbeitsunfaehigkeit.hausarzt.exp0.jpg arbeitsunfaehigkeit batch.n= ochop makebox and calling it brings up the original error: $ ./box.sh Tesseract Open Source OCR Engine v3.04.00 with Leptonica read_params_file: Can't open makebox Best regards, Marco -----Urspr=FCngliche Nachricht----- Von: cygwin-owner@cygwin.com [mailto:cygwin-owner@cygwin.com] Im Auftrag vo= n Marco Atzeri Gesendet: Montag, 21. September 2015 16:15 An: cygwin@cygwin.com Betreff: Re: Tesseract 3.04 - Cygwin64 - Windows 8.1 - Can't open makebox On 21/09/2015 11:03, Schmitz, Marco wrote: > I am using Windows 8.1 and Cygwin64 in order to run Tesseract 3.04. > > Running the following command: > > tesseract arbeitsunfaehigkeit.hausarzt.exp0.jpg arbeitsunfaehigkeit = batch.nochop makebox > > results in the following output: > > Tesseract Open Source OCR Engine v3.04.00 with Leptonica > read_params_file: Can't open makebox > > And this is after I fixed the output: > > Tesseract Open Source OCR Engine v3.04.00 with Leptonica > Error opening data file C:\DEV\tesseract\Tesseract-OCR\tessdata/eng.= traineddata Are you defining TESSDATA_PREFIX ? Why ? > Please make sure the TESSDATA_PREFIX environment variable is set to = the parent directory of your "tessdata" directory. > Failed loading language 'eng' > Tesseract couldn't load any languages! > Could not initialize tesseract. > > Using the following line in .bash_profile: > > export TESSDATA_PREFIX=3D"/cygdrive/c/DEV/cygwin64/usr/share/tessdat= a/" The default should be TESSDATA_PREFIX=3D"/usr/share/tessdata/" Without defining TESSDATA_PREFIX, I have $ tesseract.exe --list-langs List of available languages (4): deu deu_frak eng osd and the language files are in : $ ls /usr/share/tessdata/ configs/ eng.cube.fold eng.cube.size=20 osd.traineddata deu.traineddata eng.cube.lm eng.cube.word-freq pdf.ttf deu_frak.traineddata eng.cube.nn eng.tesseract_cube.nn tessconfigs/ eng.cube.bigrams eng.cube.params eng.traineddata training/ Regards Marco -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple