hocr-parser

HOCR parser

A Python library for parsing the HOCR specification into structured data

HOCR Specification Python Parser

GitHub

13 stars
8 watching
8 forks
Language: Python
last commit: about 9 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
uriparser/uriparser A C library for parsing and handling Uniform Resource Identifiers (URIs) in a strict RFC 3986 compliant manner. 336
ocropus/hocr-tools Tools for manipulating and analyzing multi-lingual OCR results by representing them in a standard HTML format 370
haskell/attoparsec A fast Haskell parser combinator library for efficient text and binary data parsing 513
r1chardj0n3s/parse A library that parses strings using a specification based on the Python format() syntax 1,713
h2o/picohttpparser A lightweight HTTP request/response parser written in C. 1,851
jturner314/py_literal A Rust crate for parsing and formatting Python literals. 16
jhumphry/parse_args A package to parse command line arguments and options in Ada 2012 12
henrypoydar/chronic_duration An elasped time parser for natural language inputs in Ruby 351
ehrhardt/infraero Tool for extracting data from Infraero's Brazilian air traffic control department website 19
amplify-education/python-hcl2 A Python parser for HCL2 configuration files used in Terraform and other tools. 255
thephpleague/uri A PHP library for manipulating and parsing URIs according to various standards 1,034
phillord/horned-owl A Rust library for processing and manipulating OWL ontologies 67
aymerick/douceur A tool for parsing and inlining CSS styles within HTML documents 246
neuralegion/har A Crystal library for parsing the HTTP Archive format 22
ystero-dev/hampi A toolkit for generating Rust bindings from ASN.1 specifications 44