OCR_GS_Data
OCR Training Data
Provides gold standard data for training and testing optical character recognition (OCR) engines.
Double-checked Gold Standard Data for Training and Testing OCR Engines
15 stars
8 watching
10 forks
last commit: almost 3 years ago
Linked from 1 awesome list
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | A collection of double-checked gold standard data for training and testing OCR engines. | 13 |
| | Training data for a handwritten recognition system | 21 |
| | Provides test data and models for training Optical Character Recognition (OCR) systems on historical printed books. | 10 |
| | An optical character recognition system deployed as a web service using a trained Tesseract OCR model | 47 |
| | Evaluates the Optical Character Recognition capabilities of GPT-4V(ision) using various tasks and scenarios to identify its strengths and weaknesses | 121 |
| | Customizes TEI XML for metadata from OCR processes to capture detailed layout and content information | 1 |
| | Automates OCR pipeline for text digitization and conversion of raw images into citable texts. | 86 |
| | An unofficial .NET wrapper around OpenAI's GPT-3 API for accessing various text and vision processing services | 1,870 |
| | Defines an open standard for storing and sharing fixed-route transit ridership data. | 49 |
| | Provides training data and scripts to enhance OCR accuracy for Irish Gaelic fonts | 4 |
| | Provides OCR services for historical documents through an intuitive web interface | 244 |
| | An OCR library allowing developers to embed high-quality character recognition functionality in their products. | 18 |
| | A GitHub profile showcasing a data engineer's skills and interests | 75 |
| | A collection of RGB-D Saliency Datasets and evaluation metrics for salient object detection | 64 |