toebler-ocr

Book Transcription Project

An OCR project using historical French book data to train models and generate transcriptions.

1 stars

4 watching

0 forks

Language: HTML

last commit: over 7 years ago

Linked from 1 awesome list

Backlinks from these awesome lists:

kba/awesome-ocr

Related projects:

Repository	Description	Stars
chreul/ocr_testdata_earlyprintedbooks	Provides test data and models for training Optical Character Recognition (OCR) systems on historical printed books.	10
jbaiter/archiscribe	A tool for transcribing OCR data from archival documents	17
hamdikahloun/windows_ocr	An OCR library allowing developers to embed high-quality character recognition functionality in their products.	18
cpitclaudel/alectryon	A tool for processing Coq and Lean 4 code embedded in text documents	237
dannnylo/rtesseract	A Ruby library providing an interface to the Tesseract OCR system.	838
jean-baptiste-camps/froc-mss	Develops models to transcribe handwritten text from Old French and Old Occitan medieval manuscripts	0
mittagessen/kraken	An OCR system optimized for historical and non-Latin scripts, providing layout analysis, character recognition, and support for various formats.	757
tberg12/ocular	An OCR system designed to transcribe historical documents with high accuracy, handling various challenges such as font variation and code-switching.	256
rescribe/carolineminuscule-groundtruth	An OCR ground truth repository for Caroline Minuscule manuscripts.	11
ub-mannheim/ocr-gt-tools	A web-based tool for editing and annotating OCR transcriptions of scanned text	48
benwbrum/fromthepage	A wiki-like application for collaborative transcription of handwritten documents from scanned pages.	171
bytedance/piano_transcription	Develops software for accurately transcribing piano recordings into MIDI files using machine learning models.	1,676
cneud/ocr-conversion	A collection of scripts and stylesheets for converting data between different OCR formats.	72
carreau/jupyter-book	A project that creates a book on Jupyter, focusing on its capabilities and applications	19
nytud/hadifogoly-adatbazis	An attempt to transcribe Cyrillic text into Hungarian script for a large dataset of WWII prisoner-of-war records	23