ocr-conversion

OCR converter toolkit

A collection of scripts and stylesheets for converting data between different OCR formats.

Conversions between various OCR formats

GitHub

72 stars
4 watching
3 forks
last commit: over 1 year ago
Linked from 1 awesome list

abbyy-xmlalto-xmlhocrocrpage-xmltei-xml

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ub-mannheim/ocr-fileformat Tool for converting and validating OCR file formats 182
openphilology/tei-ocr Customizes TEI XML for metadata from OCR processes to capture detailed layout and content information 1
ocropus/hocr-tools Tools for manipulating and analyzing multi-lingual OCR results by representing them in a standard HTML format 373
cjodo/convert.nvim A plugin to perform unit conversions for CSS and other units in design work. 46
bsoyka/advent-of-code-ocr A tool for converting ASCII art images to plain text characters 13
dbuenzli/uucd Decodes data from Unicode character database XML representation 17
tesseract-ocr/docs A collection of documents detailing various aspects and improvements to the Tesseract OCR engine 262
chreul/ocr_testdata_earlyprintedbooks Provides test data and models for training Optical Character Recognition (OCR) systems on historical printed books. 10
dannnylo/rtesseract A Ruby library providing an interface to the Tesseract OCR system. 838
input-output-hk/nix-tools A tool to convert descriptions from one format to another 95
kolpakov-p/zod-to-nestjs-graphql A package to generate GraphQL types from zod contracts in TypeScript. 20
manisandro/gimagereader A software tool that enables the conversion of images and documents into editable text using OCR technology. 1,653
baskaufs/guid-o-matic Software to convert fielded text files to RDF serialized in different formats 12
laurentmazare/npy-ocaml A library that allows OCaml bigarrays to be written and read in the NumPy file format 41
ironymark/abbyytoalto Converts Abbyy FineReader XML to ALTO XML format 9