nokolexbor

HTML parser library

A high-performance HTML5 parser for Ruby based on Lexbor with support for CSS selectors and XPath.

High-performance HTML5 parser for Ruby based on Lexbor, with support for both CSS selectors and XPath.

GitHub

244 stars
13 watching
4 forks
Language: C
last commit: 3 months ago
Linked from 1 awesome list

c-extensioncsshtml5parserrubyserpapiweb-scrapingxpath

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
servo/html5ever A high-performance HTML parser written in Rust. 2,148
lexborisov/myhtml A fast HTML parsing library written in C 1,655
mischov/meeseeks A parser and extractor for HTML and XML data with CSS or XPath selectors 316
imangazaliev/didom A fast and simple HTML parser with support for CSS selectors and XPath expressions. 2,200
rubycocos/feedparser A universal web feed parser and normalizer for various formats including Atom, RSS, JSON Feed, HTML, and more. 164
felipecsl/wombat A Ruby-based web crawler and data extraction tool with an elegant DSL. 1,315
utkarshkukreti/select.rs A Rust library for extracting useful data from HTML documents 974
lexbor/lexbor A fast, portable, and modular HTML rendering engine written in C. 1,615
kovidgoyal/html5-parser A fast HTML parser written in C, optimized for performance. 682
marcelog/ex_rfc3966 A Tel URI parser based on the official uri grammar following the RFC3966 standard. 1
hashrock/deno-fnparse A parser combinator for JavaScript to parse CSV files based on RFC4180 standards. 11
seamusabshere/remote_table A Ruby library that reads and parses various file formats, including XLSX, XLS, ODS, CSV, TSV, and Google Docs, returning tabular data as arrays or hashes. 226
antchfx/htmlquery A Golang package for extracting data from HTML documents using XPath expressions. 740
henrypoydar/chronic_duration An elasped time parser for natural language inputs in Ruby 351
techiferous/rack-plastic A gem that simplifies the development of Rack middleware by providing a convenient API for parsing and modifying HTML responses. 22