DUP-ocropy

Document analyzer

A collection of tools for document analysis and OCR.

Python-based tools for document analysis and OCR

GitHub

3k stars
205 watching
591 forks
Language: Jupyter Notebook
last commit: over 3 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ocrmypdf/ocrmypdf A tool that adds OCR text to scanned PDF files, allowing them to be searchable and copy-pasted. 14,140
ocropus/hocr-tools Tools for manipulating and analyzing multi-lingual OCR results by representing them in a standard HTML format 370
openphilology/nidaba Automates OCR pipeline for text digitization and conversion of raw images into citable texts. 86
mindee/doctr A deep learning-based OCR library that enables efficient text parsing and recognition from documents 3,859
openseg-group/openseg.pytorch Provides a PyTorch implementation of several computer vision tasks including object detection, segmentation and parsing. 1,190
kba/docker-ocropy An OCR system built into a Docker container to perform text recognition on images. 9
ml-tooling/opyrator Automates conversion of machine learning code into production-ready microservices with web API and GUI. 3,102
otiai10/gosseract An OCR package using Tesseract C++ library to extract text from images 2,718
qubvel-org/segmentation_models.pytorch A PyTorch library for building and training neural networks for image segmentation tasks. 9,696
oyam/pytorch-dpns PyTorch implementation of a deep learning model for image segmentation 90
ocramius/proxymanager Generates and manages proxies of objects to abstract away complex behavior 4,954
opendronemap/odm Toolkit for generating maps and 3D models from drone images 4,901
holoviz/holoviews A Python library that simplifies data analysis and visualization by annotating data instead of relying on manual plotting 2,707
tesseract-ocr/tesseract An OCR engine capable of recognizing text in images from various languages and formats. 62,363