OCR_GS_Data
OCR Training Data
Provides gold standard data for training and testing optical character recognition (OCR) engines.
Double-checked Gold Standard Data for Training and Testing OCR Engines
15 stars
8 watching
10 forks
last commit: about 2 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A collection of double-checked gold standard data for training and testing OCR engines. | 13 |
| Training data for a handwritten recognition system | 21 |
| Provides test data and models for training Optical Character Recognition (OCR) systems on historical printed books. | 10 |
| An optical character recognition system deployed as a web service using a trained Tesseract OCR model | 47 |
| Evaluates the Optical Character Recognition capabilities of GPT-4V(ision) using various tasks and scenarios to identify its strengths and weaknesses | 121 |
| Customizes TEI XML for metadata from OCR processes to capture detailed layout and content information | 1 |
| Automates OCR pipeline for text digitization and conversion of raw images into citable texts. | 86 |
| An unofficial .NET wrapper around OpenAI's GPT-3 API for accessing various text and vision processing services | 1,870 |
| Defines an open standard for storing and sharing fixed-route transit ridership data. | 49 |
| Provides training data and scripts to enhance OCR accuracy for Irish Gaelic fonts | 4 |
| Provides OCR services for historical documents through an intuitive web interface | 244 |
| An OCR library allowing developers to embed high-quality character recognition functionality in their products. | 18 |
| A GitHub profile showcasing a data engineer's skills and interests | 75 |
| A collection of RGB-D Saliency Datasets and evaluation metrics for salient object detection | 64 |