html5lib-python
HTML parser
A standards-compliant Python library for parsing and serializing HTML documents and fragments.
Standards-compliant library for parsing and serializing HTML documents and fragments in Python
1k stars
50 watching
285 forks
Language: Python
last commit: about 1 year ago
Linked from 2 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
| A fast HTML parser written in C, optimized for performance. | 682 |
| A pure-python library for extracting structured data from HTML pages. | 1,865 |
| A fast and efficient HTML parser for PHP. | 525 |
| An HTML parser designed to meet the standards of modern web browsers | 2,171 |
| A fast HTML parsing library written in C | 1,657 |
| An HTML parsing library that converts web pages to structured data and then generates Markdown content from that data | 1 |
| An HTML5 parser for Common Lisp. | 55 |
| A Pythonic HTML parsing library providing intuitive and asynchronous web scraping capabilities. | 304 |
| A Python library for parsing and analyzing output files from computational chemistry packages | 339 |
| A fast and simple HTML parser with support for CSS selectors and XPath expressions. | 2,202 |
| A JavaScript library that parses QML and JavaScript files at runtime | 28 |
| An Objective-C framework for parsing and serializing HTML documents | 240 |
| A library that parses strings using a specification based on the Python format() syntax | 1,732 |
| A Haskell library for parsing and extracting information from HTML/XML documents | 233 |
| A PHP library for manipulating and parsing URIs according to various standards | 1,045 |