doctr

OCR engine

A deep learning-based OCR library that enables efficient text parsing and recognition from documents

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

GitHub

4k stars
43 watching
453 forks
Language: Python
last commit: about 1 month ago
Linked from 1 awesome list

deep-learningdocument-recognitionocroptical-character-recognitionpytorchtensorflow2text-detectiontext-detection-recognitiontext-recognition

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
deepdoctection/deepdoctection An integrated framework for document AI tasks using deep learning models. 2,628
ocrmypdf/ocrmypdf A tool that adds OCR text to scanned PDF files, allowing them to be searchable and copy-pasted. 14,363
tesseract-ocr/tesseract An OCR engine capable of recognizing text in images from various languages and formats. 63,142
docarray/docarray A Python library for representing, transmitting, storing, and retrieving multimodal data 2,998
emedvedev/attention-ocr A TensorFlow model for recognizing text in images using visual attention and a sequence-to-sequence architecture. 1,079
ocropus-archive/dup-ocropy A collection of tools for document analysis and OCR. 3,426
clovaai/deep-text-recognition-benchmark Provides a benchmarking framework and implementation for deep learning-based text recognition models 3,769
hrnet/hrnet-semantic-segmentation An implementation of a high-resolution neural network architecture for semantic segmentation tasks. 3,178
brightmart/text_classification An NLP project offering various text classification models and techniques for deep learning exploration 7,881
doccano/doccano An annotation tool for machine learning practitioners to create labeled datasets 9,645
ibm/max-ocr An optical character recognition system deployed as a web service using a trained Tesseract OCR model 47
x-plug/mplug-docowl A large language model designed to understand documents without OCR, focusing on document structure and content analysis. 1,958
huggingface/transformers A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. 136,357
vikparuchuri/marker Converts PDF documents to text formats with high accuracy and support for various document types 18,618
mittagessen/kraken An OCR system optimized for historical and non-Latin scripts, providing layout analysis, character recognition, and support for various formats. 757