doctr

OCR engine

A deep learning-based OCR library that enables efficient text parsing and recognition from documents

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

GitHub

4k stars

43 watching

453 forks

Language: Python

last commit: over 1 year ago

Linked from 1 awesome list

deep-learningdocument-recognitionocroptical-character-recognitionpytorchtensorflow2text-detectiontext-detection-recognitiontext-recognition

mindee.github.io/doctr/

Backlinks from these awesome lists:

kba/awesome-ocr

Related projects:

Repository	Description	Stars
deepdoctection/deepdoctection	An integrated framework for document AI tasks using deep learning models.	2,628
ocrmypdf/ocrmypdf	A tool that adds OCR text to scanned PDF files, allowing them to be searchable and copy-pasted.	14,363
tesseract-ocr/tesseract	An OCR engine capable of recognizing text in images from various languages and formats.	63,142
docarray/docarray	A Python library for representing, transmitting, storing, and retrieving multimodal data	2,998
emedvedev/attention-ocr	A TensorFlow model for recognizing text in images using visual attention and a sequence-to-sequence architecture.	1,079
ocropus-archive/dup-ocropy	A collection of tools for document analysis and OCR.	3,426
clovaai/deep-text-recognition-benchmark	Provides a benchmarking framework and implementation for deep learning-based text recognition models	3,769
hrnet/hrnet-semantic-segmentation	An implementation of a high-resolution neural network architecture for semantic segmentation tasks.	3,178
brightmart/text_classification	An NLP project offering various text classification models and techniques for deep learning exploration	7,881
doccano/doccano	An annotation tool for machine learning practitioners to create labeled datasets	9,645
ibm/max-ocr	An optical character recognition system deployed as a web service using a trained Tesseract OCR model	47
x-plug/mplug-docowl	A large language model designed to understand documents without OCR, focusing on document structure and content analysis.	1,958
huggingface/transformers	A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects.	136,357
vikparuchuri/marker	Converts PDF documents to text formats with high accuracy and support for various document types	18,618
mittagessen/kraken	An OCR system optimized for historical and non-Latin scripts, providing layout analysis, character recognition, and support for various formats.	757