pagexml

Document annotator library

A C++ and Python library for working with annotated document formats

Library in C++ and a python wrapper for dealing with Page XML files

GitHub

13 stars
6 watching
2 forks
Language: C++
last commit: about 1 month ago
Linked from 1 awesome list

annotation-processingdocker-imagedocument-representationpagexmlpython

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
lxml/lxml-stubs Provides type annotations for a specific Python package to support static analysis and code completion tools 43
pyexcel/pyexcel A unified API for reading and manipulating various spreadsheet file formats. 1,215
synyi/poplar A web-based annotation tool for natural language processing (NLP) 519
michaelbromley/ngx-pagination A simple pagination solution for Angular applications 1,232
prima-research-lab/page-xml Provides an XML format for representing document image page content and layout information. 66
jflarvoire/libxml2 An XML toolkit with added support for Simplified XML (SML) parsing and generation. 3
dncuug/x.pagedlist A library for managing paginated data in ASP.NET applications 902
accelerationnet/cl-mediawiki A Common Lisp wrapper around the MediaWiki API to facilitate interaction with MediaWiki servers. 18
closedxml/closedxml.extensions.webapi Web API extensions for working with ClosedXML files 34
jveitchmichaelis/deeplabel A cross-platform tool for annotating images with labelled bounding boxes 209
lzx1413/labelimgplus An image annotation tool supporting various modes and formats, including CLS, DET, SEG, and PASCAL VOC. 211
dwd/rapidxml A C++ library for parsing and working with XML data structures 152
omnedia/ngx-word-pullup A component library for animating words in angular applications 0
xyntopia/pydoxtools A Python library for extracting information from unstructured documents using AI techniques and customizable pipelines. 77
openphilology/tei-ocr Customizes TEI XML for metadata from OCR processes to capture detailed layout and content information 1