From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 114863 invoked by alias); 2 Aug 2015 12:23:01 -0000 Mailing-List: contact cygwin-announce-help@cygwin.com; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-announce-owner@cygwin.com Reply-To: The Cygwin Mailing List Mail-Followup-To: cygwin-announce@cygwin.com Received: (qmail 127233 invoked by uid 89); 2 Aug 2015 08:45:00 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-2.4 required=5.0 tests=AWL,BAYES_00,FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS autolearn=ham version=3.3.2 X-HELO: mail-wi0-f179.google.com X-Received: by 10.180.104.167 with SMTP id gf7mr23078476wib.86.1438505096229; Sun, 02 Aug 2015 01:44:56 -0700 (PDT) To: cygwin-announce@cygwin.com Cc: tesseract-ocr@googlegroups.com From: Marco Atzeri Subject: Updated: tesseract-ocr-3.04.00-2 Message-ID: <55BDD87B.2020502@gmail.com> Date: Sun, 02 Aug 2015 12:23:00 -0000 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:38.0) Gecko/20100101 Thunderbird/38.1.0 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit X-SW-Source: 2015-08/txt/msg00005.txt.bz2 Version 3.04.00-2 of packages libtesseract-ocr_3 tesseract-ocr tesseract-ocr-devel tesseract-training-util (NEW) and version 3.04-1 of tesseract-ocr-eng tesseract-ocr-deu tesseract-ocr-fra tesseract-ocr-ita tesseract-ocr-nld tesseract-ocr-por tesseract-ocr-spa tesseract-ocr-vie tesseract-training-core (NEW) tesseract-training-eng (NEW) tesseract-training-deu (NEW) tesseract-training-fra (NEW) tesseract-training-ita (NEW) tesseract-training-nld (NEW) tesseract-training-por (NEW) tesseract-training-spa (NEW) tesseract-training-vie (NEW) are available in the Cygwin distribution: Other language specific data are available upstream https://github.com/tesseract-ocr/tessdata while training data for building new language data are in https://github.com/tesseract-ocr/langdata CYGWIN CHANGES Rebuilt to include the training tools and base data to create or update language data. Training tools, not needed for normal users, are in tesseract-training-util and data in tesseract-training-core tesseract-training-{lang} CHANGES None. Last upstream release. https://github.com/tesseract-ocr/tesseract/wiki DESCRIPTION Tesseract is probably the most accurate open source OCR engine available. Combined with the Leptonica Image Processing Library it can read a wide variety of image formats and convert them to text in over 60 languages. It was one of the top 3 engines in the 1995 UNLV Accuracy test. Improved extensively by Google. It is released under the Apache License 2.0. HOMEPAGE https://github.com/tesseract-ocr/ Marco Atzeri If you have questions or comments, please send them to the cygwin mailing list at: cygwin (at) cygwin (dot) com .