tesseract: An OCR Engine that was developed at HP Labs between 1985 and 19951

The Tesseract OCR engine was one of the top 3 engines in the 1995 UNLV Accuracy test. Since then it has had little work done on it, but it is probably one of the most accurate open source OCR engines available.

... part of T2, get it here

URL: https://code.google.com/p/tesseract-ocr/

Author: Ray Smith <theraysmith [at] users [dot] sourceforge [dot] net>
Maintainer: Rene Rebe <rene [at] t2-project [dot] org>

License: APL
Status: Stable
Version: 5.4.1

Remark: Does cross compile (as setup and patched in T2).

Download: https://github.com/tesseract-ocr/tesseract/ tesseract-5.4.1.tar.gz
Download: https://github.com/tesseract-ocr/tessdata/ tessdata-4.1.0.tar.gz

T2 source: tesseract.cache
T2 source: tesseract.conf
T2 source: tesseract.desc

Build time (on reference hardware): 40% (relative to binutils)2

Installed size (on reference hardware): 20.59 MB, 341 files

Dependencies (build time detected): 00-dirtree binutils coreutils diffutils findutils gawk grep leptonlib libjpeg libpng libtiff linux-header make patch sed sysfiles tar zlib

Installed files (on reference hardware): [show]

1) This page was automatically generated from the T2 package source. Corrections, such as dead links, URL changes or typos need to be performed directly on that source.

2) Compatible with Linux From Scratch's "Standard Build Unit" (SBU).