hOCRTools

hOCR transformer

Utilities to process and transform hOCR files into ALTO format using XSLT transformations

Utilities to process and handle hOCR

GitHub

6 stars
7 watching
0 forks
Language: XSLT
last commit: over 6 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ub-mannheim/ocr-fileformat Tool for converting and validating OCR file formats 180
ocropus/hocr-tools Tools for manipulating and analyzing multi-lingual OCR results by representing them in a standard HTML format 370
kba/hocr-spec A specification for an embedded OCR workflow and output format 74
sillsdev/odtxslt A C# library that performs XSLT transformations on package contents 2
svenskaspel/har2locust Automatically converts browser recordings (.har files) into locust scripts. 160
athento/hocr-parser A Python library for parsing the HOCR specification into structured data 13
syntax-tree/hast-util-from-text Utility to transform plain text into a hast node's visible content 2
apache/xalan-c A C++ library for transforming XML documents using XSLT 1.0 standards 29
chrdebru/r2rml An implementation of an R2RML processor for transforming relational databases into RDF data 30
flavorjones/loofah A Ruby library that provides tools for transforming and sanitizing HTML documents and fragments 935
isl/x3ml Engine for transforming XML data into RDF format based on predefined mappings and policies 20
sl1pm4t/k2tf Converts Kubernetes API objects to Terraform configuration language 1,191
mandrean/har-rs A library for serializing and deserializing the HTTP Archive format 44
auteon/puppeteer-har Tools to capture and generate HAR files from Puppeteer browser interactions 10
micahhausler/container-transform Transforms container configurations between various formats 1,411