toebler-ocr
Book Transcription Project
An OCR project using historical French book data to train models and generate transcriptions.
1 stars
4 watching
0 forks
Language: HTML
last commit: about 6 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| Provides test data and models for training Optical Character Recognition (OCR) systems on historical printed books. | 10 |
| A tool for transcribing OCR data from archival documents | 17 |
| An OCR library allowing developers to embed high-quality character recognition functionality in their products. | 18 |
| A tool for processing Coq and Lean 4 code embedded in text documents | 237 |
| A Ruby library providing an interface to the Tesseract OCR system. | 838 |
| Develops models to transcribe handwritten text from Old French and Old Occitan medieval manuscripts | 0 |
| An OCR system optimized for historical and non-Latin scripts, providing layout analysis, character recognition, and support for various formats. | 757 |
| An OCR system designed to transcribe historical documents with high accuracy, handling various challenges such as font variation and code-switching. | 256 |
| An OCR ground truth repository for Caroline Minuscule manuscripts. | 11 |
| A web-based tool for editing and annotating OCR transcriptions of scanned text | 48 |
| A wiki-like application for collaborative transcription of handwritten documents from scanned pages. | 171 |
| Develops software for accurately transcribing piano recordings into MIDI files using machine learning models. | 1,676 |
| A collection of scripts and stylesheets for converting data between different OCR formats. | 72 |
| A project that creates a book on Jupyter, focusing on its capabilities and applications | 19 |
| An attempt to transcribe Cyrillic text into Hungarian script for a large dataset of WWII prisoner-of-war records | 23 |