pup
HTML parser
A command line tool for parsing and manipulating HTML
Parsing HTML at the command line
8k stars
90 watching
259 forks
Language: HTML
last commit: 9 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
jhy/jsoup | A Java library for parsing and manipulating HTML, XML, and CSS | 10,985 |
scinfu/swiftsoup | A Swift library for parsing and manipulating HTML documents. | 4,586 |
terrier989/universal_html | A cross-platform Dart package for parsing and manipulating HTML, XML, and CSS documents across various platforms. | 0 |
cheeriojs/cheerio | A fast and flexible HTML parser and DOM manipulator with jQuery-like API | 28,793 |
gjtorikian/html-pipeline | A Ruby-based framework for transforming user content into HTML markup with chainable filters | 2,267 |
zzzprojects/html-agility-pack | An HTML parsing library that allows developers to parse and manipulate malformed HTML documents | 2,665 |
uncss/grunt-uncss | A task for removing unused CSS from projects | 3,824 |
jmcarp/robobrowser | A Python library for interacting with web pages without the need for a standalone browser | 3,703 |
fox-it/dissect.ole | A parser for the Object Linking & Embedding (OLE) format used by Windows document editors. | 3 |
mholt/papaparse | A fast and reliable CSV parser for JavaScript applications | 12,621 |
bevacqua/woofmark | Rich text editing library for HTML and Markdown | 1,628 |
lexborisov/myhtml | A fast HTML parsing library written in C | 1,657 |
symfony/panther | A library that automates browser testing and web scraping using real browsers and the WebDriver protocol. | 2,955 |
iabudiab/htmlkit | An Objective-C framework for parsing and serializing HTML documents | 240 |
ezyang/htmlpurifier | An HTML filtering solution that ensures documents from untrusted sources are standards compliant and safe from XSS attacks. | 3,106 |