html-agility-pack

HTML parser

An HTML parsing library that allows developers to parse and manipulate malformed HTML documents

Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.

GitHub

3k stars
82 watching
381 forks
Language: C#
last commit: 3 months ago
Linked from 1 awesome list

haphtml-parserhtmlagilitypackparsexpath

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
anglesharp/anglesharp A C# library that parses and constructs HTML5, MathML, SVG, and CSS documents into a standard DOM representation for .NET developers. 5,197
zhegexiaohuozi/jsoupxpath A Java-based HTML parser implementing the W3C XPATH 1.0 standard syntax for XPath expressions. 453
haml/haml A templating engine for HTML that uses a concise syntax and automatic indentation to simplify the process of writing and rendering HTML documents 3,766
webreflection/hyperhtml A lightweight virtual DOM alternative built on top of HTML template literals 3,071
terrier989/universal_html A cross-platform Dart package for parsing and manipulating HTML, XML, and CSS documents across various platforms. 0
jhy/jsoup A Java library for parsing and manipulating HTML, XML, and CSS 10,985
xoofx/markdig A fast, extensible Markdown processor for .NET with support for various Markdown flavors and features. 4,468
haxtheweb/hax11ty A toolset for building and deploying static websites with a minimal backend CMS 8
fb55/htmlparser2 A fast and forgiving HTML parser with a focus on minimal allocations 4,474
ericchiang/pup A command line tool for parsing and manipulating HTML 8,185
rehypejs/rehype-dom A library for parsing and compiling HTML with browser APIs 26
antchfx/htmlquery A Golang package for extracting data from HTML documents using XPath expressions. 744
lexborisov/myhtml A fast HTML parsing library written in C 1,657
extism/extism A framework that enables building extensible software using WebAssembly, supporting multiple languages and runtimes. 4,562
ezyang/htmlpurifier An HTML filtering solution that ensures documents from untrusted sources are standards compliant and safe from XSS attacks. 3,106