alto-tools

ALTO file processor

A set of Python tools for extracting and processing data from ALTO XML files

Python tools for performing various operations on ALTO XML files

GitHub

39 stars
3 watching
15 forks
Language: Python
last commit: about 1 year ago
Linked from 1 awesome list

alto-xmldigital-libraryoptical-character-recognition

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
cneud/ocr-conversion A collection of scripts and stylesheets for converting data between different OCR formats. 71
altoxml/documentation Provides documentation and support materials for working with ALTO XML files 39
open-eo/openeo_odc_driver A Python-based processing engine for OpenEO datacube operations 7
altoxml/schema Repository containing official and draft ALTO XML schema versions, with documentation on proposed changes and updates. 51
ioannad/asd-graph Tools for visualizing Common Lisp system dependencies 3
code-kern-ai/refinery A tool to help data scientists manage and annotate natural language data for training AI models 1,402
dr-leo/pandasdmx Provides tools to access and manipulate SDMX-compliant data in various formats 127
achavignon/pala A MATLAB project providing tools and functions for ultrasound localization microscopy research. 62
patois/xray Tool for filtering and highlighting decompiler output based on regular expressions 125
kinwaicheuk/nnaudio An audio processing toolkit using PyTorch convolutional neural networks to generate spectrograms from raw audio data 1,032
raine/ramda-cli A tool for composing functions into data-processing pipelines to produce desired output. 573
prosch88/ufade Automates the acquisition and backup of data from Apple devices. 156
observingclouds/pysonde A tool for converting and post-processing atmospheric sounding data from radiosonde files to netCDF format. 8
augustinmortier/a-profiles A Python library for reading and processing airborne lidar and ceilometer data 11
ctornau/latex Automates LaTeX document processing with Docker-based CI/CD pipelines 1