pup
HTML parser
A command line tool for parsing and manipulating HTML
Parsing HTML at the command line
8k stars
89 watching
256 forks
Language: HTML
last commit: 7 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
jhy/jsoup | A Java library for parsing and manipulating HTML, XML, and CSS | 10,949 |
scinfu/swiftsoup | A Swift library for parsing and manipulating HTML documents. | 4,553 |
terrier989/universal_html | A cross-platform Dart package for parsing and manipulating HTML, XML, and CSS documents across various platforms. | 0 |
cheeriojs/cheerio | A fast and flexible HTML parser and DOM manipulator with jQuery-like API | 28,692 |
gjtorikian/html-pipeline | A Ruby-based framework for transforming user content into HTML markup with chainable filters | 2,266 |
zzzprojects/html-agility-pack | An HTML parsing library that allows developers to parse and manipulate malformed HTML documents | 2,652 |
uncss/grunt-uncss | A task for removing unused CSS from projects | 3,825 |
jmcarp/robobrowser | A Python library for interacting with web pages without the need for a standalone browser | 3,702 |
fox-it/dissect.ole | A Python module that parses the Object Linking & Embedding (OLE) format used by document editors on Windows operating systems. | 3 |
mholt/papaparse | A fast and reliable CSV parser for JavaScript applications | 12,552 |
bevacqua/woofmark | Rich text editing library for HTML and Markdown | 1,630 |
lexborisov/myhtml | A fast HTML parsing library written in C | 1,655 |
symfony/panther | A library that automates browser testing and web scraping using real browsers and the WebDriver protocol. | 2,944 |
iabudiab/htmlkit | An Objective-C framework for parsing and serializing HTML documents | 240 |
ezyang/htmlpurifier | An HTML filtering solution that ensures documents from untrusted sources are standards compliant and safe from XSS attacks. | 3,091 |