ocular

Historical OCR software

An OCR system designed to transcribe historical documents with high accuracy, handling various challenges such as font variation and code-switching.

Ocular is a state-of-the-art historical OCR system.

GitHub

256 stars
32 watching
48 forks
Language: Java
last commit: 6 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
chreul/ocr_testdata_earlyprintedbooks Provides test data and models for training Optical Character Recognition (OCR) systems on historical printed books. 10
ocr4all/ocr4all Provides OCR services for historical documents through an intuitive web interface 244
ryanfb/ancientgreekocr-ocr-evaluation-tools A collection of tools and scripts to evaluate the accuracy of Optical Character Recognition (OCR) systems 22
ibm/max-ocr An optical character recognition system deployed as a web service using a trained Tesseract OCR model 47
ncsu-libraries/ocracoke A Rails application that enables the creation of OCR capabilities for indexing text from page images and providing search results in IIIF format. 34
r1me/ttesseractocr4 An Object Pascal binding for the Tesseract OCR engine to perform optical character recognition 145
antoniogarrote/clj-tesseract A Clojure wrapper for the Tesseract OCR software, allowing developers to easily integrate optical character recognition capabilities into their applications. 54
ponteineptique/toebler-ocr An OCR project using historical French book data to train models and generate transcriptions. 1
hamdikahloun/windows_ocr An OCR library allowing developers to embed high-quality character recognition functionality in their products. 18
mittagessen/kraken An OCR system optimized for historical and non-Latin scripts, providing layout analysis, character recognition, and support for various formats. 757
ocropus/hocr-tools Tools for manipulating and analyzing multi-lingual OCR results by representing them in a standard HTML format 373
jzarca01/whoffer A React Native app that uses OCR to claim rewards by deciphering images 0
tesseract-ocr/docs A collection of documents detailing various aspects and improvements to the Tesseract OCR engine 262
ub-mannheim/ocr-gt-tools A web-based tool for editing and annotating OCR transcriptions of scanned text 48
openseg-group/openseg.pytorch Provides a PyTorch implementation of several computer vision tasks including object detection, segmentation and parsing. 1,191