doctr
OCR engine
A deep learning-based OCR library that enables efficient text parsing and recognition from documents
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
4k stars
43 watching
453 forks
Language: Python
last commit: 2 months ago
Linked from 1 awesome list
deep-learningdocument-recognitionocroptical-character-recognitionpytorchtensorflow2text-detectiontext-detection-recognitiontext-recognition
Related projects:
Repository | Description | Stars |
---|---|---|
| An integrated framework for document AI tasks using deep learning models. | 2,628 |
| A tool that adds OCR text to scanned PDF files, allowing them to be searchable and copy-pasted. | 14,363 |
| An OCR engine capable of recognizing text in images from various languages and formats. | 63,142 |
| A Python library for representing, transmitting, storing, and retrieving multimodal data | 2,998 |
| A TensorFlow model for recognizing text in images using visual attention and a sequence-to-sequence architecture. | 1,079 |
| A collection of tools for document analysis and OCR. | 3,426 |
| Provides a benchmarking framework and implementation for deep learning-based text recognition models | 3,769 |
| An implementation of a high-resolution neural network architecture for semantic segmentation tasks. | 3,178 |
| An NLP project offering various text classification models and techniques for deep learning exploration | 7,881 |
| An annotation tool for machine learning practitioners to create labeled datasets | 9,645 |
| An optical character recognition system deployed as a web service using a trained Tesseract OCR model | 47 |
| A large language model designed to understand documents without OCR, focusing on document structure and content analysis. | 1,958 |
| A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. | 136,357 |
| Converts PDF documents to text formats with high accuracy and support for various document types | 18,618 |
| An OCR system optimized for historical and non-Latin scripts, providing layout analysis, character recognition, and support for various formats. | 757 |