pagexml

Document annotator library

A C++ and Python library for working with annotated document formats

Library in C++ and a python wrapper for dealing with Page XML files

GitHub

13 stars

6 watching

2 forks

Language: C++

last commit: 9 months ago

Linked from 1 awesome list

annotation-processingdocker-imagedocument-representationpagexmlpython

Backlinks from these awesome lists:

kba/awesome-ocr

Related projects:

Repository	Description	Stars
lxml/lxml-stubs	Provides type annotations for a specific Python package to support static analysis and code completion tools	45
pyexcel/pyexcel	An API for reading and manipulating data in various spreadsheet formats	1,221
synyi/poplar	A web-based annotation tool for natural language processing (NLP)	520
michaelbromley/ngx-pagination	A simple pagination solution for Angular applications	1,233
prima-research-lab/page-xml	Provides an XML format for representing document image page content and layout information.	66
jflarvoire/libxml2	An XML toolkit with added support for Simplified XML (SML) parsing and generation.	3
dncuug/x.pagedlist	A library for managing paginated data in ASP.NET applications	910
accelerationnet/cl-mediawiki	A Common Lisp wrapper around the MediaWiki API to facilitate interaction with MediaWiki servers.	18
closedxml/closedxml.extensions.webapi	Web API extensions for working with ClosedXML files	34
jveitchmichaelis/deeplabel	A cross-platform tool for annotating images with labelled bounding boxes	209
lzx1413/labelimgplus	An image annotation tool supporting various modes and formats, including CLS, DET, SEG, and PASCAL VOC.	211
dwd/rapidxml	A C++ library for parsing and working with XML data structures	152
omnedia/ngx-word-pullup	An Angular component library providing a smooth animation effect for sequentially displaying words	0
xyntopia/pydoxtools	A Python library for extracting information from unstructured documents using AI techniques and customizable pipelines.	78
openphilology/tei-ocr	Customizes TEI XML for metadata from OCR processes to capture detailed layout and content information	1