OCR_GS_Data

OCR Training Data

Provides gold standard data for training and testing optical character recognition (OCR) engines.

Double-checked Gold Standard Data for Training and Testing OCR Engines

GitHub

15 stars
8 watching
10 forks
last commit: about 2 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
openarabic/ocr_gs_data A collection of double-checked gold standard data for training and testing OCR engines. 13
igobronidze/hrs_training_data Training data for a handwritten recognition system 21
chreul/ocr_testdata_earlyprintedbooks Provides test data and models for training Optical Character Recognition (OCR) systems on historical printed books. 10
ibm/max-ocr An optical character recognition system deployed as a web service using a trained Tesseract OCR model 47
scut-dlvclab/gpt-4v_ocr Evaluates the Optical Character Recognition capabilities of GPT-4V(ision) using various tasks and scenarios to identify its strengths and weaknesses 121
openphilology/tei-ocr Customizes TEI XML for metadata from OCR processes to capture detailed layout and content information 1
openphilology/nidaba Automates OCR pipeline for text digitization and conversion of raw images into citable texts. 86
okgodoit/openai-api-dotnet An unofficial .NET wrapper around OpenAI's GPT-3 API for accessing various text and vision processing services 1,870
odot-pts/gtfs-ride Defines an open standard for storing and sharing fixed-route transit ridership data. 49
kscanne/tesseract-gle-uncial Provides training data and scripts to enhance OCR accuracy for Irish Gaelic fonts 4
ocr4all/ocr4all Provides OCR services for historical documents through an intuitive web interface 244
hamdikahloun/windows_ocr An OCR library allowing developers to embed high-quality character recognition functionality in their products. 18
thomas-george-t/thomas-george-t A GitHub profile showcasing a data engineer's skills and interests 75
jiwei0921/rgbd-sod-datasets A collection of RGB-D Saliency Datasets and evaluation metrics for salient object detection 64