pytesseract

Image OCR

An OCR tool that wraps the Google Tesseract Engine to recognize text in images.

A Python wrapper for Google Tesseract

GitHub

6k stars
110 watching
721 forks
Language: Python
last commit: 25 days ago
Linked from 4 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
tesseract-ocr/tesseract An OCR engine capable of recognizing text in images from various languages and formats. 62,363
dannnylo/tesseract-ocr-elixir A Tesseract OCR wrapper providing Elixir bindings to read and process text from images 54
charlesw/tesseract A .NET wrapper for the Tesseract OCR engine, providing a simple interface to perform Optical Character Recognition (OCR) tasks. 2,291
mdelete/node-tesseract-native An OCR module for Node.js using Tesseract and Leptonica to recognize text from images 51
dannnylo/tesseract-ocr-crystal A wrapper around Tesseract OCR that provides an easy-to-use interface for reading characters from images. 13
zapolnoch/node-tesseract-ocr An OCR library that wraps the Tesseract API in Node.js to extract text from images 305
nguyenq/tess4j A Java wrapper for using the Tesseract OCR API to extract text from images 1,612
meh/ruby-tesseract-ocr A Ruby wrapper around the Tesseract OCR API to provide an easy-to-use interface for optical character recognition tasks 629
antoniogarrote/clj-tesseract A Clojure wrapper for the Tesseract OCR software, allowing developers to easily integrate optical character recognition capabilities into their applications. 54
gali8/tesseract-ocr-ios An iOS framework providing Optical Character Recognition (OCR) capabilities using the Tesseract OCR engine. 4,220
ropensci/tesseract Provides R interface to a powerful OCR engine supporting multiple languages and enabling text recognition from images. 245
kscanne/tesseract-gle-uncial Provides training data and scripts to enhance OCR accuracy for Irish Gaelic fonts 3
guitarmind/tesseract-web-service A RESTful web service that utilizes Tesseract-OCR for image recognition 135
mauvilsa/tesseract-recognize A tool for performing text recognition and layout analysis using Tesseract OCR, outputting results in Page XML format. 44
sirfz/tesserocr An OCR API wrapper that enables concurrent execution using Python's threading module and releases the GIL. 2,016