html-agility-pack

HTML parser

An HTML parsing library that allows developers to parse and manipulate malformed HTML documents

Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.

GitHub

3k stars
82 watching
376 forks
Language: C#
last commit: 10 days ago
Linked from 1 awesome list

haphtml-parserhtmlagilitypackparsexpath

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
anglesharp/anglesharp A C# library that parses and constructs HTML5, MathML, SVG, and CSS documents into a standard DOM representation for .NET developers. 5,168
zhegexiaohuozi/jsoupxpath An HTML parser implementing W3C XPATH 1.0 syntax for Java. 452
haml/haml A templating engine for HTML written in Ruby, designed to simplify and beautify HTML document generation. 3,766
webreflection/hyperhtml A lightweight virtual DOM alternative built on top of HTML template literals 3,070
terrier989/universal_html A cross-platform Dart package for parsing and manipulating HTML, XML, and CSS documents across various platforms. 0
jhy/jsoup A Java library for parsing and manipulating HTML, XML, and CSS 10,949
xoofx/markdig A fast, extensible Markdown processor for .NET with support for various Markdown flavors and features. 4,390
haxtheweb/hax11ty A toolset for building and deploying static websites with a minimal backend CMS 8
fb55/htmlparser2 A fast and forgiving HTML parser with a focus on minimal allocations 4,451
ericchiang/pup A command line tool for parsing and manipulating HTML 8,116
rehypejs/rehype-dom A library for parsing and compiling HTML with browser APIs 26
antchfx/htmlquery A Golang package for extracting data from HTML documents using XPath expressions. 738
lexborisov/myhtml A fast HTML parsing library written in C 1,655
extism/extism A framework for building extensible software and plugins by running arbitrary, untrusted code in a secure environment. 4,319
ezyang/htmlpurifier An HTML filtering solution that ensures documents from untrusted sources are standards compliant and safe from XSS attacks. 3,091