nokolexbor
HTML parser library
A high-performance HTML5 parser for Ruby based on Lexbor with support for CSS selectors and XPath.
High-performance HTML5 parser for Ruby based on Lexbor, with support for both CSS selectors and XPath.
244 stars
13 watching
4 forks
Language: C
last commit: 3 months ago
Linked from 1 awesome list
c-extensioncsshtml5parserrubyserpapiweb-scrapingxpath
Related projects:
Repository | Description | Stars |
---|---|---|
servo/html5ever | A high-performance HTML parser written in Rust. | 2,148 |
lexborisov/myhtml | A fast HTML parsing library written in C | 1,655 |
mischov/meeseeks | A parser and extractor for HTML and XML data with CSS or XPath selectors | 316 |
imangazaliev/didom | A fast and simple HTML parser with support for CSS selectors and XPath expressions. | 2,200 |
rubycocos/feedparser | A universal web feed parser and normalizer for various formats including Atom, RSS, JSON Feed, HTML, and more. | 164 |
felipecsl/wombat | A Ruby-based web crawler and data extraction tool with an elegant DSL. | 1,315 |
utkarshkukreti/select.rs | A Rust library for extracting useful data from HTML documents | 974 |
lexbor/lexbor | A fast, portable, and modular HTML rendering engine written in C. | 1,615 |
kovidgoyal/html5-parser | A fast HTML parser written in C, optimized for performance. | 682 |
marcelog/ex_rfc3966 | A Tel URI parser based on the official uri grammar following the RFC3966 standard. | 1 |
hashrock/deno-fnparse | A parser combinator for JavaScript to parse CSV files based on RFC4180 standards. | 11 |
seamusabshere/remote_table | A Ruby library that reads and parses various file formats, including XLSX, XLS, ODS, CSV, TSV, and Google Docs, returning tabular data as arrays or hashes. | 226 |
antchfx/htmlquery | A Golang package for extracting data from HTML documents using XPath expressions. | 740 |
henrypoydar/chronic_duration | An elasped time parser for natural language inputs in Ruby | 351 |
techiferous/rack-plastic | A gem that simplifies the development of Rack middleware by providing a convenient API for parsing and modifying HTML responses. | 22 |