hocr-spec
OCR format
A specification for an embedded OCR workflow and output format
The hOCR Embedded OCR Workflow and Output Format
74 stars
13 watching
20 forks
Language: HTML
last commit: 8 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| Tools for manipulating and analyzing multi-lingual OCR results by representing them in a standard HTML format | 373 |
| Tool for converting and validating OCR file formats | 182 |
| An OCR system built into a Docker container to perform text recognition on images. | 9 |
| An OCR library allowing developers to embed high-quality character recognition functionality in their products. | 18 |
| A containerized implementation of OCR software for document recognition | 2 |
| Provides OCR services for historical documents through an intuitive web interface | 244 |
| A Docker container for running the Kraken OCR engine | 5 |
| An OCR system optimized for historical and non-Latin scripts, providing layout analysis, character recognition, and support for various formats. | 757 |
| Utilities to process and transform hOCR files into ALTO format using XSLT transformations | 6 |
| An optical character recognition system deployed as a web service using a trained Tesseract OCR model | 47 |
| An Object Pascal binding for the Tesseract OCR engine to perform optical character recognition | 145 |
| A C library for parsing and generating CBOR data format | 347 |
| A collection of documents detailing various aspects and improvements to the Tesseract OCR engine | 262 |
| A Ruby wrapper around the Tesseract OCR API to provide an easy-to-use interface for optical character recognition tasks | 629 |