node-osmosis
Web scraper
A fast and flexible web scraping library using native libxml C bindings
Web scraper for NodeJS
4k stars
74 watching
247 forks
Language: JavaScript
last commit: 12 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
ruipgil/scraperjs | A versatile web scraping module with two scrapers for static and dynamic content extraction. | 3,712 |
ionicabizau/scrape-it | A Node.js library and CLI tool for automating web page scraping and parsing | 4,021 |
bda-research/node-crawler | A NodeJS-based web crawler and spider that extracts data from websites. | 6,713 |
apify/crawlee | A tool for building reliable web scraping and browser automation pipelines in Node.js. | 15,845 |
jhy/jsoup | A Java library for parsing and manipulating HTML, XML, and CSS | 10,963 |
node-formidable/formidable | A module for parsing multipart form data, especially file uploads in Node.js applications. | 7,060 |
sparklemotion/nokogiri | A Ruby library for parsing and manipulating XML and HTML documents | 6,159 |
nodeca/embedza | Creates HTML snippets from URLs using data from oEmbed, Open Graph, and meta tags. | 64 |
fb55/htmlparser2 | A fast and forgiving HTML parser with a focus on minimal allocations | 4,465 |
axios/axios | An HTTP client library for making requests to web servers using the Promise API. | 105,901 |
clean-css/clean-css | A fast and efficient CSS optimizer for Node.js and modern browsers | 4,167 |
matthewmueller/x-ray | A flexible web scraping framework for extracting data from websites with customizable selectors and pagination support. | 5,881 |
naturalintelligence/fast-xml-parser | A fast and efficient JavaScript library for parsing and generating XML data | 2,596 |
joseconstela/webparsy | A Node.js library and CLI for scraping websites using Puppeteer and YAML definitions | 44 |
javve/list.js | A JavaScript library for adding search, sort, filters and flexibility to tables and lists in HTML elements. | 11,207 |