pup

HTML parser

A command line tool for parsing and manipulating HTML

Parsing HTML at the command line

GitHub

8k stars
89 watching
256 forks
Language: HTML
last commit: 7 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
jhy/jsoup A Java library for parsing and manipulating HTML, XML, and CSS 10,949
scinfu/swiftsoup A Swift library for parsing and manipulating HTML documents. 4,553
terrier989/universal_html A cross-platform Dart package for parsing and manipulating HTML, XML, and CSS documents across various platforms. 0
cheeriojs/cheerio A fast and flexible HTML parser and DOM manipulator with jQuery-like API 28,692
gjtorikian/html-pipeline A Ruby-based framework for transforming user content into HTML markup with chainable filters 2,266
zzzprojects/html-agility-pack An HTML parsing library that allows developers to parse and manipulate malformed HTML documents 2,652
uncss/grunt-uncss A task for removing unused CSS from projects 3,825
jmcarp/robobrowser A Python library for interacting with web pages without the need for a standalone browser 3,702
fox-it/dissect.ole A Python module that parses the Object Linking & Embedding (OLE) format used by document editors on Windows operating systems. 3
mholt/papaparse A fast and reliable CSV parser for JavaScript applications 12,552
bevacqua/woofmark Rich text editing library for HTML and Markdown 1,630
lexborisov/myhtml A fast HTML parsing library written in C 1,655
symfony/panther A library that automates browser testing and web scraping using real browsers and the WebDriver protocol. 2,944
iabudiab/htmlkit An Objective-C framework for parsing and serializing HTML documents 240
ezyang/htmlpurifier An HTML filtering solution that ensures documents from untrusted sources are standards compliant and safe from XSS attacks. 3,091