tesseract-recognize

OCR tool

A tool for performing text recognition and layout analysis using Tesseract OCR, outputting results in Page XML format.

Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format

GitHub

44 stars
5 watching
7 forks
Language: C++
last commit: 8 months ago
Linked from 1 awesome list

clidocker-imagedocument-recognitionocroptical-character-recognitionpagexmltesseracttext-detection

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
meh/ruby-tesseract-ocr A Ruby wrapper around the Tesseract OCR API to provide an easy-to-use interface for optical character recognition tasks 629
r1me/ttesseractocr4 An Object Pascal binding for the Tesseract OCR engine to perform optical character recognition 145
tesseract-ocr/docs A collection of documents detailing various aspects and improvements to the Tesseract OCR engine 262
zapolnoch/node-tesseract-ocr An OCR library that wraps the Tesseract API in Node.js to extract text from images 308
ropensci/tesseract Provides R interface to a powerful OCR engine supporting multiple languages and enabling text recognition from images. 245
antoniogarrote/clj-tesseract A Clojure wrapper for the Tesseract OCR software, allowing developers to easily integrate optical character recognition capabilities into their applications. 54
mdelete/node-tesseract-native An OCR module for Node.js using Tesseract and Leptonica to recognize text from images 51
alimranahmed/laraocr An OCR package for Laravel that reads text from images using Tesseract 152
jeenyuhs/vesseract A wrapper around Tesseract-OCR to simplify OCR functionality in V programming language 17
sirfz/tesserocr An OCR API wrapper that enables concurrent execution using Python's threading module and releases the GIL. 2,026
dannnylo/tesseract-ocr-crystal A wrapper around Tesseract OCR that provides an easy-to-use interface for reading characters from images. 13
ibm/max-ocr An optical character recognition system deployed as a web service using a trained Tesseract OCR model 47
tesseract-ocr/tesseract An OCR engine capable of recognizing text in images from various languages and formats. 63,142
nguyenq/tess4j A Java wrapper for using the Tesseract OCR API to extract text from images 1,619
dannnylo/tesseract-ocr-elixir A Tesseract OCR wrapper providing Elixir bindings to read and process text from images 55