DUP-ocropy

Document analyzer

A collection of tools for document analysis and OCR.

Python-based tools for document analysis and OCR

GitHub

3k stars
205 watching
593 forks
Language: Jupyter Notebook
last commit: over 3 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ocrmypdf/ocrmypdf A tool that adds OCR text to scanned PDF files, allowing them to be searchable and copy-pasted. 14,363
ocropus/hocr-tools Tools for manipulating and analyzing multi-lingual OCR results by representing them in a standard HTML format 373
openphilology/nidaba Automates OCR pipeline for text digitization and conversion of raw images into citable texts. 86
mindee/doctr A deep learning-based OCR library that enables efficient text parsing and recognition from documents 4,011
openseg-group/openseg.pytorch Provides a PyTorch implementation of several computer vision tasks including object detection, segmentation and parsing. 1,191
kba/docker-ocropy An OCR system built into a Docker container to perform text recognition on images. 9
ml-tooling/opyrator Automates conversion of machine learning code into production-ready microservices with web API and GUI. 3,116
otiai10/gosseract An OCR package using Tesseract C++ library to extract text from images 2,751
qubvel-org/segmentation_models.pytorch A comprehensive library for training and applying deep learning models for image segmentation 9,829
oyam/pytorch-dpns PyTorch implementation of a deep learning model for image segmentation 90
ocramius/proxymanager Generates and manages proxy classes to abstract object behavior 4,956
opendronemap/odm Toolkit for generating maps and 3D models from drone images 4,944
holoviz/holoviews A Python library that simplifies data analysis and visualization by annotating data instead of relying on manual plotting 2,719
tesseract-ocr/tesseract An OCR engine capable of recognizing text in images from various languages and formats. 63,142