toebler-ocr

Book Transcription Project

An OCR project using historical French book data to train models and generate transcriptions.

GitHub

1 stars
4 watching
0 forks
Language: HTML
last commit: almost 6 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
chreul/ocr_testdata_earlyprintedbooks Provides test data and models for training Optical Character Recognition (OCR) systems on historical printed books. 10
jbaiter/archiscribe A tool for transcribing OCR data from archival documents 17
hamdikahloun/windows_ocr An OCR library allowing developers to embed high-quality character recognition functionality in their products. 18
cpitclaudel/alectryon Tools for processing Coq code and prose in technical documents 236
dannnylo/rtesseract A Ruby library providing an interface to the Tesseract OCR system. 828
jean-baptiste-camps/froc-mss Develops models to transcribe handwritten text from Old French and Old Occitan medieval manuscripts 0
mittagessen/kraken An OCR system optimized for historical and non-Latin scripts 748
tberg12/ocular An OCR system designed to transcribe historical documents with high accuracy, handling various challenges such as font variation and code-switching. 255
rescribe/carolineminuscule-groundtruth An OCR ground truth repository for Caroline Minuscule manuscripts. 11
ub-mannheim/ocr-gt-tools A web-based tool for editing and annotating OCR transcriptions of scanned text 48
benwbrum/fromthepage A wiki-like application for collaborative transcription of handwritten documents from scanned pages. 171
bytedance/piano_transcription Develops software for accurately transcribing piano recordings into MIDI files using machine learning models. 1,658
cneud/ocr-conversion A collection of scripts and stylesheets for converting data between different OCR formats. 71
carreau/jupyter-book A project that creates a book on Jupyter, focusing on its capabilities and applications 19
nytud/hadifogoly-adatbazis An attempt to transcribe Cyrillic text into Hungarian script for a large dataset of WWII prisoner-of-war records 23