OCR_GS_Data

OCR Training Data

Provides gold standard data for training and testing optical character recognition (OCR) engines.

Double-checked Gold Standard Data for Training and Testing OCR Engines

GitHub

15 stars
8 watching
10 forks
last commit: almost 2 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
openarabic/ocr_gs_data A collection of double-checked gold standard data for training and testing OCR engines. 13
igobronidze/hrs_training_data Training data for a handwritten recognition system 20
chreul/ocr_testdata_earlyprintedbooks Provides test data and models for training Optical Character Recognition (OCR) systems on historical printed books. 10
ibm/max-ocr An optical character recognition system deployed as a web service using a trained Tesseract OCR model 47
scut-dlvclab/gpt-4v_ocr Evaluates the Optical Character Recognition capabilities of GPT-4V(ision) using various tasks and scenarios to identify its strengths and weaknesses 120
openphilology/tei-ocr Customizes TEI XML for metadata from OCR processes to capture detailed layout and content information 1
openphilology/nidaba Automates OCR pipeline for text digitization and conversion of raw images into citable texts. 86
okgodoit/openai-api-dotnet An unofficial .NET wrapper around OpenAI's GPT-3 API for accessing various text and vision processing services 1,860
odot-pts/gtfs-ride Defines an open standard for storing and sharing fixed-route transit ridership data. 49
kscanne/tesseract-gle-uncial Provides training data and scripts to enhance OCR accuracy for Irish Gaelic fonts 3
ocr4all/ocr4all Provides a platform for converting historical printed materials into editable digital text 238
hamdikahloun/windows_ocr An OCR library allowing developers to embed high-quality character recognition functionality in their products. 18
thomas-george-t/thomas-george-t A GitHub profile showcasing a data engineer's skills and interests 74
jiwei0921/rgbd-sod-datasets A collection of pre-processed RGB-D saliency detection datasets 63