9951 explained code solutions for 126 technologies


tesseract-ocrHow can I use Tesseract OCR to recognize Russian text?


Tesseract OCR can be used to recognize Russian text. To do so, the Tesseract command line tool needs to be installed and configured to use the rus language.

First, install the Tesseract command line tool:

sudo apt-get install tesseract-ocr

Then, install the rus language:

sudo apt-get install tesseract-ocr-rus

Once installed, run the Tesseract command line tool to recognize Russian text from an image file:

tesseract image.png output -l rus

This command will save the recognized text from the image file image.png to the output.txt file.

Code explanation

  1. sudo apt-get install tesseract-ocr - to install the Tesseract command line tool
  2. sudo apt-get install tesseract-ocr-rus - to install the rus language
  3. tesseract image.png output -l rus - to recognize Russian text from an image file

Helpful links

Edit this code on GitHub