OCR_GS_Data

OCR dataset

A collection of double-checked gold standard data for training and testing OCR engines.

Double-checked Gold Standard Data for Training and Testing OCR Engines

GitHub

13 stars
5 watching
14 forks
Language: HTML
last commit: over 7 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
openiti/ocr_gs_data Provides gold standard data for training and testing optical character recognition (OCR) engines. 15
chreul/ocr_testdata_earlyprintedbooks Provides test data and models for training Optical Character Recognition (OCR) systems on historical printed books. 10
oscarmcnulty/gta-3d-dataset A dataset of 2D images and 3D data generated from the Grand Theft Auto game engine for object localization research. 134
osdg-ai/osdg-data A dataset of human-labeled text excerpts validated against the Sustainable Development Goals. 28
tleyden/open-ocr An OCR-as-a-Service using Tesseract and Docker with scalable architecture and support for multiple languages. 1,342
opengeos/geospatial-data-catalogs Compiles lists of publicly available geospatial datasets from various cloud platforms. 526
ospector/gtest-gbar A C# wrapper around Google Test to enhance its user interface 131
open-sdg/open-sdg A platform for collecting and disseminating data for global sustainability indicators 62
gopherdata/resources A collection of Go-based resources and tools for data science tasks 876
igobronidze/hrs_training_data Training data for a handwritten recognition system 20
pedrobarcha/old-books-dataset A collection of scanned book pages with ground truth annotations for OCR research and text analysis 12
pku-yuangroup/open-sora-dataset A large video dataset collected from various open-source websites for use in computer vision and multimedia applications. 94
antimatter15/gocr.js A JavaScript OCR engine using Emscripten compiled C code 98
hamdikahloun/windows_ocr An OCR library allowing developers to embed high-quality character recognition functionality in their products. 18
openseg-group/openseg.pytorch Provides a PyTorch implementation of several computer vision tasks including object detection, segmentation and parsing. 1,190