JsoupXpath

HTML parser with XPath support

A Java-based HTML parser implementing the W3C XPATH 1.0 standard syntax for XPath expressions.

纯Java实现的支持W3C Xpath 1.0标准语法的HTML解析器。A html parser with xpath base on Jsoup and Antlr4. Maybe it is the best in java.Just try it.

GitHub

453 stars
21 watching
154 forks
Language: HTML
last commit: about 2 months ago
antlr4html-parserjsoupxpathxpath

Related projects:

Repository Description Stars
antchfx/htmlquery A Golang package for extracting data from HTML documents using XPath expressions. 744
cezheng/fuzi An XML and HTML parser for Swift with XPath and CSS support 1,082
yahoojapan/swiftyxmlparser An XML parsing library implemented in Swift 584
lexborisov/myhtml A fast HTML parsing library written in C 1,657
technosophos/querypath A PHP library for manipulating XML and HTML documents, supporting various input formats and offering robust functionality through chaining. 823
zhgchgli/zmarkupparser A pure-Swift library for converting HTML strings into attributed strings with customizable styles and tags. 321
pharo-contributions/xml-xmlparserhtml Provides tools to parse HTML and convert it into well-formed XML, enabling web scrapping with XPath. 6
drmohundro/swxmlhash A Swift wrapper around XML parsing APIs, providing a simple way to parse XML into dictionary of arrays. 1,412
imangazaliev/didom A fast and simple HTML parser with support for CSS selectors and XPath expressions. 2,202
chenyunguimilook/swiftyxml A Swift library that provides a simple and Swifty way to parse and manipulate XML data. 103
antchfx/xmlquery An XML query package providing a simple and efficient way to extract data from XML documents using XPath expressions. 451
sharplispers/xpath An implementation of the XML Path Language (XPath) for querying and manipulating XML documents 8
phpgt/cssxpath Translates CSS selectors to XPath queries for use with PHP's DOM functionality. 20
serpapi/nokolexbor A high-performance HTML5 parser for Ruby based on Lexbor with support for CSS selectors and XPath. 327
mischov/meeseeks A parser and extractor for HTML and XML data with CSS or XPath selectors 316