tesseract-ocrHow do I use the Tesseract OCR API?
The Tesseract OCR API is an open-source library that can be used to recognize text from an image. To use the Tesseract OCR API, you will need to install it on your computer.
Once the Tesseract OCR API is installed, you can use the following code example to recognize text from an image:
import pytesseract
from PIL import Image
# Read image from which text needs to be extracted
img = Image.open('sample.png')
# Run tesseract OCR on image
text = pytesseract.image_to_string(img)
# Print recognized text
print(text)
The code example above will print out the text that is recognized from the image.
Code explanation
import pytesseract
: This imports the Tesseract OCR API library.from PIL import Image
: This imports the Python Imaging Library (PIL) which is used to read images.img = Image.open('sample.png')
: This reads an image file called ‘sample.png’.text = pytesseract.image_to_string(img)
: This uses the Tesseract OCR API to recognize text from the image.print(text)
: This prints out the text that is recognized from the image.
Helpful links
More of Tesseract Ocr
- How do I install Tesseract OCR on Windows?
- How do tesseract ocr and easyocr compare in terms of accuracy and speed of text recognition?
- How can I use tesseract OCR architecture to achieve optical character recognition?
- How do I add Tesseract OCR to my environment variables?
- How can I use Tesseract OCR on Windows via the command line?
- How to install and use Tesseract OCR on Arch Linux?
- How can I use Python to get the coordinates of words detected by Tesseract OCR?
- How can I use Tesseract OCR with VBA?
- How can I use Tesseract to perform zonal OCR?
- How do I install Tesseract-OCR using Yum?
See more codes...