GOT-OCR2.0

OCR model

A Python implementation of an end-to-end OCR model for unified general OCR theory, supporting various formats and fine-grained recognition.

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

GitHub

6k stars
53 watching
515 forks
Language: Python
last commit: 6 days ago

Related projects:

Repository Description Stars
tesseract-ocr/tesseract An OCR engine capable of recognizing text in images from various languages and formats. 62,363
jaidedai/easyocr An OCR system that supports multiple languages and writing scripts. 24,528
ofa-sys/ofa Develops a unified sequence-to-sequence learning framework to unify modalities and tasks through pretraining and fine-tuning 2,419
openiti/ocr_gs_data Provides gold standard data for training and testing optical character recognition (OCR) engines. 15
chreul/ocr_testdata_earlyprintedbooks Provides test data and models for training Optical Character Recognition (OCR) systems on historical printed books. 10
hrnet/hrnet-semantic-segmentation An implementation of a high-resolution neural network architecture for semantic segmentation tasks. 3,156
ryanfb/ancientgreekocr-ocr-evaluation-tools A collection of tools and scripts to evaluate the accuracy of Optical Character Recognition (OCR) systems 22
kwai-kolors/kolors A Python framework for training and deploying photorealistic text-to-image synthesis models. 3,862
otiai10/gosseract An OCR package using Tesseract C++ library to extract text from images 2,718
openarabic/ocr_gs_data A collection of double-checked gold standard data for training and testing OCR engines. 13
scut-dlvclab/gpt-4v_ocr Evaluates the Optical Character Recognition capabilities of GPT-4V(ision) using various tasks and scenarios to identify its strengths and weaknesses 120
emedvedev/attention-ocr A TensorFlow model for recognizing text in images using visual attention and a sequence-to-sequence architecture. 1,077
ibm/max-ocr An optical character recognition system deployed as a web service using a trained Tesseract OCR model 47
oeg-upm/gtfs-bench Provides a benchmarking framework for evaluating declarative knowledge graph construction engines in the transport domain 17
ub-mannheim/ocr-fileformat Tool for converting and validating OCR file formats 180