pagexml

Document annotator library

A C++ and Python library for working with annotated document formats

Library in C++ and a python wrapper for dealing with Page XML files

GitHub

13 stars
6 watching
2 forks
Language: C++
last commit: 3 months ago
Linked from 1 awesome list

annotation-processingdocker-imagedocument-representationpagexmlpython

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
lxml/lxml-stubs Provides type annotations for a specific Python package to support static analysis and code completion tools 45
pyexcel/pyexcel An API for reading and manipulating data in various spreadsheet formats 1,221
synyi/poplar A web-based annotation tool for natural language processing (NLP) 520
michaelbromley/ngx-pagination A simple pagination solution for Angular applications 1,233
prima-research-lab/page-xml Provides an XML format for representing document image page content and layout information. 66
jflarvoire/libxml2 An XML toolkit with added support for Simplified XML (SML) parsing and generation. 3
dncuug/x.pagedlist A library for managing paginated data in ASP.NET applications 910
accelerationnet/cl-mediawiki A Common Lisp wrapper around the MediaWiki API to facilitate interaction with MediaWiki servers. 18
closedxml/closedxml.extensions.webapi Web API extensions for working with ClosedXML files 34
jveitchmichaelis/deeplabel A cross-platform tool for annotating images with labelled bounding boxes 209
lzx1413/labelimgplus An image annotation tool supporting various modes and formats, including CLS, DET, SEG, and PASCAL VOC. 211
dwd/rapidxml A C++ library for parsing and working with XML data structures 152
omnedia/ngx-word-pullup An Angular component library providing a smooth animation effect for sequentially displaying words 0
xyntopia/pydoxtools A Python library for extracting information from unstructured documents using AI techniques and customizable pipelines. 78
openphilology/tei-ocr Customizes TEI XML for metadata from OCR processes to capture detailed layout and content information 1