hocr-spec
OCR format
A specification for an embedded OCR workflow and output format
The hOCR Embedded OCR Workflow and Output Format
74 stars
13 watching
20 forks
Language: HTML
last commit: about 1 year ago
Linked from 1 awesome list
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | Tools for manipulating and analyzing multi-lingual OCR results by representing them in a standard HTML format | 373 |
| | Tool for converting and validating OCR file formats | 182 |
| | An OCR system built into a Docker container to perform text recognition on images. | 9 |
| | An OCR library allowing developers to embed high-quality character recognition functionality in their products. | 18 |
| | A containerized implementation of OCR software for document recognition | 2 |
| | Provides OCR services for historical documents through an intuitive web interface | 244 |
| | A Docker container for running the Kraken OCR engine | 5 |
| | An OCR system optimized for historical and non-Latin scripts, providing layout analysis, character recognition, and support for various formats. | 757 |
| | Utilities to process and transform hOCR files into ALTO format using XSLT transformations | 6 |
| | An optical character recognition system deployed as a web service using a trained Tesseract OCR model | 47 |
| | An Object Pascal binding for the Tesseract OCR engine to perform optical character recognition | 145 |
| | A C library for parsing and generating CBOR data format | 347 |
| | A collection of documents detailing various aspects and improvements to the Tesseract OCR engine | 262 |
| | A Ruby wrapper around the Tesseract OCR API to provide an easy-to-use interface for optical character recognition tasks | 629 |