9951 explained code solutions for 126 technologies


tesseract-ocrHow do I use Tesseract OCR with JavaScript?


Tesseract OCR (Optical Character Recognition) is a powerful open source software library for reading text from images. It can be used with JavaScript for extracting text from images.

To use Tesseract OCR with JavaScript, you need to install the Tesseract.js library. It is a JavaScript port of the popular Tesseract OCR engine.

Example code

const Tesseract = require('tesseract.js');
const image = 'path/to/image.png';

Tesseract.recognize(image)
  .progress(function  (p) { console.log('progress', p)  })
  .then(function (result) {
    console.log(result.text);
  })

The code above loads the Tesseract.js library, defines the path to the image and then calls the recognize() function. The progress() function is used to display the progress of the recognition process, and the then() function is used to display the extracted text.

Code explanation

  • const Tesseract = require('tesseract.js');: Loads the Tesseract.js library.
  • const image = 'path/to/image.png';: Defines the path to the image.
  • Tesseract.recognize(image): Calls the recognize() function.
  • .progress(function (p) { console.log('progress', p) }): Displays the progress of the recognition process.
  • .then(function (result) { console.log(result.text); }): Displays the extracted text.

Helpful links

Edit this code on GitHub