hred

Data extractor

Extracts data from HTML or XML documents to JSON using a CSS selector-like query language

Reduce HTML and XML to JSON from the command line, using an expressive query language inspired by CSS selectors.

GitHub

69 stars
3 watching
1 forks
Language: JavaScript
last commit: about 2 months ago
clidata-extractionhtmljsonxml

Related projects:

Repository Description Stars
feichao93/temme A lightweight, CSS-based selector for extracting structured data from HTML documents. 273
plainas/tq Tool that extracts content from HTML documents based on CSS selectors 236
djhohnstein/sharpchromium Tool to extract data from Chromium-based browsers 692
yogthos/json-html Converts JSON data into human-readable HTML 162
davemolk/gogetjs Tools for extracting and analyzing JavaScript files from web pages 40
eyurtsev/kor Extracts structured data from unstructured text using large language models 1,629
utkarshkukreti/select.rs A Rust library for extracting useful data from HTML documents 974
bezoerb/grunt-critical Grunt plugin to extract and inline critical CSS from HTML files 154
mischov/meeseeks A parser and extractor for HTML and XML data with CSS or XPath selectors 316
djhohnstein/sharpweb A .NET project that extracts saved browser credentials from Google Chrome, Firefox, and Internet Explorer/Edge. 510
gandm/language-babel Tools and features for syntax highlighting and code analysis in JavaScript and related technologies 476
jjelosua/doga_scraper A tool that extracts and converts Galician Official journal documents to different formats based on input year. 0
003random/getjs A tool to extract JavaScript sources from URLs and web pages efficiently 712
karlicoss/kobuddy Extracts data from Kobo eReader databases for analysis and backup 149
theodi/csv2json Converts comma-separated values to JSON data format 2