OCR_GS_Data
OCR Training Data
Provides gold standard data for training and testing optical character recognition (OCR) engines.
Double-checked Gold Standard Data for Training and Testing OCR Engines
15 stars
8 watching
10 forks
last commit: almost 2 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
openarabic/ocr_gs_data | A collection of double-checked gold standard data for training and testing OCR engines. | 13 |
igobronidze/hrs_training_data | Training data for a handwritten recognition system | 20 |
chreul/ocr_testdata_earlyprintedbooks | Provides test data and models for training Optical Character Recognition (OCR) systems on historical printed books. | 10 |
ibm/max-ocr | An optical character recognition system deployed as a web service using a trained Tesseract OCR model | 47 |
scut-dlvclab/gpt-4v_ocr | Evaluates the Optical Character Recognition capabilities of GPT-4V(ision) using various tasks and scenarios to identify its strengths and weaknesses | 120 |
openphilology/tei-ocr | Customizes TEI XML for metadata from OCR processes to capture detailed layout and content information | 1 |
openphilology/nidaba | Automates OCR pipeline for text digitization and conversion of raw images into citable texts. | 86 |
okgodoit/openai-api-dotnet | An unofficial .NET wrapper around OpenAI's GPT-3 API for accessing various text and vision processing services | 1,860 |
odot-pts/gtfs-ride | Defines an open standard for storing and sharing fixed-route transit ridership data. | 49 |
kscanne/tesseract-gle-uncial | Provides training data and scripts to enhance OCR accuracy for Irish Gaelic fonts | 3 |
ocr4all/ocr4all | Provides a platform for converting historical printed materials into editable digital text | 238 |
hamdikahloun/windows_ocr | An OCR library allowing developers to embed high-quality character recognition functionality in their products. | 18 |
thomas-george-t/thomas-george-t | A GitHub profile showcasing a data engineer's skills and interests | 74 |
jiwei0921/rgbd-sod-datasets | A collection of pre-processed RGB-D saliency detection datasets | 63 |