pup

HTML parser

A command line tool for parsing and manipulating HTML

Parsing HTML at the command line

GitHub

8k stars
90 watching
259 forks
Language: HTML
last commit: 9 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
jhy/jsoup A Java library for parsing and manipulating HTML, XML, and CSS 10,985
scinfu/swiftsoup A Swift library for parsing and manipulating HTML documents. 4,586
terrier989/universal_html A cross-platform Dart package for parsing and manipulating HTML, XML, and CSS documents across various platforms. 0
cheeriojs/cheerio A fast and flexible HTML parser and DOM manipulator with jQuery-like API 28,793
gjtorikian/html-pipeline A Ruby-based framework for transforming user content into HTML markup with chainable filters 2,267
zzzprojects/html-agility-pack An HTML parsing library that allows developers to parse and manipulate malformed HTML documents 2,665
uncss/grunt-uncss A task for removing unused CSS from projects 3,824
jmcarp/robobrowser A Python library for interacting with web pages without the need for a standalone browser 3,703
fox-it/dissect.ole A parser for the Object Linking & Embedding (OLE) format used by Windows document editors. 3
mholt/papaparse A fast and reliable CSV parser for JavaScript applications 12,621
bevacqua/woofmark Rich text editing library for HTML and Markdown 1,628
lexborisov/myhtml A fast HTML parsing library written in C 1,657
symfony/panther A library that automates browser testing and web scraping using real browsers and the WebDriver protocol. 2,955
iabudiab/htmlkit An Objective-C framework for parsing and serializing HTML documents 240
ezyang/htmlpurifier An HTML filtering solution that ensures documents from untrusted sources are standards compliant and safe from XSS attacks. 3,106