jsoup
HTML parser
A Java library for parsing and manipulating HTML, XML, and CSS
jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.
11k stars
392 watching
2k forks
Language: Java
last commit: 3 months ago csscss-selectorsdomhtmljavajava-html-parserjsoupparserxmlxpath
Related projects:
Repository | Description | Stars |
---|---|---|
| A Java-based HTML parser implementing the W3C XPATH 1.0 standard syntax for XPath expressions. | 453 |
| A Java library that provides annotations to simplify HTML scraping and processing with Jsoup | 239 |
| A Haskell library that simplifies HTML parsing by providing CSS selectors and attribute extraction functions. | 123 |
| Automates web page scraping and text extraction to make any webpage readable | 343 |
| A fast and flexible HTML parser and DOM manipulator with jQuery-like API | 28,793 |
| A pure-JavaScript implementation of various web standards for use with Node.js | 20,630 |
| A command line tool for parsing and manipulating HTML | 8,185 |
| A Haskell library for parsing and extracting information from HTML/XML documents | 233 |
| A fast and simple HTML parser with support for CSS selectors and XPath expressions. | 2,202 |
| A fast and forgiving HTML parser with a focus on minimal allocations | 4,474 |
| A fast HTML parsing library written in C | 1,657 |
| An HTML parsing library that converts web pages to structured data and then generates Markdown content from that data | 1 |
| A plugin to inspect and manipulate URLs in HTML documents | 20 |
| A JavaScript library for adding search, sort, filters and flexibility to tables and lists in HTML elements. | 11,207 |
| A functional HTML scraping and manipulation library in OCaml | 384 |