[ACCEPTED]-Installing Tesseract-OCR on CentOS 6-tesseract
I used these instructions which worked correctly 4 in Centos
Install Tesseract OCR libs from sources in Centos
Download Leptonica and Teseract 3 sources:
$ wget http://www.leptonica.org/source/leptonica-1.69.tar.gz
$ wget https://tesseract-ocr.googlecode.com/files/tesseract-ocr-3.02.02.tar.gz
Configure, compile, install libs:
$ tar xzvf leptonica-1.69.tar.gz
$ cd leptonica-1.69
$ ./configure
$ make
$ sudo make install
$ tar xzf tesseract-ocr-3.02.02.tar.gz
$ cd tesseract-3.01
$ ./autogen.sh
$ ./configure
$ make
$ sudo make install
$ sudo ldconfig
Download 2 languages (english) and copy to tessdata 1 folder:
$ wget http://tesseract-ocr.googlecode.com/files/tesseract-ocr-3.02.eng.tar.gz
$ tar xzf tesseract-ocr-3.02.eng.tar.gz
$ sudo cp tesseract-ocr/tessdata/* /usr/local/share/tessdata
and enjoy it ;)
I recommend to try installing from rpm here: http://pkgs.org/download/tesseract There 3 are also several dependencies: libpng-devel, libjpeg-devel, libtiff-devel, zlib 2 and leptonica. Last 2 can also be found 1 on RPM site
I have written a bash script to install 5 Tesseract 3.05 on Centos 7. This fetches 4 and installs all dependencies, and also 3 installs language files for English, Hindi, Bengali 2 and Thai.
Code available on GitHub
https://github.com/EisenVault/install-tesseract-redhat-centos
Hope this 1 helps.
This worked for me :
/usr/bin/yum --enablerepo epel-testing install tesseract.x86_64 tesseract-langpack-fra.noarch
tesseract is not in 2 the epel repository but in the epel-testing 1 repo witch is not activated by default.
Install Tesseract OCR libs from sources (UPDATED as on 14th July 2018)
Download Leptonica and Teseract sources:
$ wget 8 http://www.leptonica.com/source/leptonica-1.76.0.tar.gz
$ wget https://sourceforge.net/projects/tesseract-ocr-alt/files/tesseract-ocr-3.02.02.tar.gz
Configure, compile, install Leptonica:
$ tar xzvf leptonica-1.76.0.tar.gz
$ cd leptonica-1.76.0
$ ./configure & make & sudo make install
Configure, compile, install 7 Tesseract:
$ tar xzf tesseract-ocr-3.02.02.tar.gz
$ cd tesseract-ocr
$ ./autogen.sh & ./configure & make & sudo make install & sudo ldconfig
Download language file:
I am downloading 6 english language file(eng.traineddata) here. You 5 can see complete list of language files 4 here and download as per your need. https://github.com/tesseract-ocr/tesseract/wiki/Data-Files#data-files-for-version-302
Download 3 languages (english) and copy to tessdata 2 folder:
$ wget https://sourceforge.net/projects/tesseract-ocr-alt/files/tesseract-ocr-3.02.eng.tar.gz
$ tar xzf tesseract-ocr-3.02.eng.tar.gz
$ sudo cp tesseract-ocr/tessdata/* /usr/local/share/tessdata
Now your Tesseract OCR is installed 1 and ready to use! Example:
$tesseract /path/to/input/test.jpg /path/to/output/abc.txt -l eng
Enjoy!!!
yum install --nogpgcheck tesseract
after 2 installation to test enter the following 1 command: tesseract --version
More Related questions
We use cookies to improve the performance of the site. By staying on our site, you agree to the terms of use of cookies.