html5-parser

HTML parser

A fast HTML parser written in C, optimized for performance.

Fast C based HTML 5 parsing for python

GitHub

682 stars
17 watching
36 forks
Language: C
last commit: 4 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
html5lib/html5lib-python A standards-compliant Python library for parsing and serializing HTML documents and fragments. 1,138
servo/html5ever An HTML parser designed to meet the standards of modern web browsers 2,171
imangazaliev/didom A fast and simple HTML parser with support for CSS selectors and XPath expressions. 2,202
lexborisov/myhtml A fast HTML parsing library written in C 1,657
snjyor/htmlpageparser An HTML parsing library that converts web pages to structured data and then generates Markdown content from that data 1
chadwain/rem A parser for HTML5 documents written in Zig 108
kostya/myhtml A fast HTML5 parser with CSS selectors in Crystal language 154
bupt1987/html-parser A fast and efficient HTML parser for PHP. 525
iabudiab/htmlkit An Objective-C framework for parsing and serializing HTML documents 240
kennethreitz/requests-html A Pythonic HTML parsing library providing intuitive and asynchronous web scraping capabilities. 304
scrapy/scrapely A pure-python library for extracting structured data from HTML pages. 1,865
himujjal/zig-json5 A JSON5 parser/stringifier for Zig programming language. 23
serpapi/nokolexbor A high-performance HTML5 parser for Ruby based on Lexbor with support for CSS selectors and XPath. 327
p-ranav/csv2 A fast and efficient CSV parsing library for C++ 567
antchfx/htmlquery A Golang package for extracting data from HTML documents using XPath expressions. 744