node-osmosis

Web scraper

A fast and flexible web scraping library using native libxml C bindings

Web scraper for NodeJS

GitHub

4k stars
74 watching
247 forks
Language: JavaScript
last commit: 12 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ruipgil/scraperjs A versatile web scraping module with two scrapers for static and dynamic content extraction. 3,712
ionicabizau/scrape-it A Node.js library and CLI tool for automating web page scraping and parsing 4,021
bda-research/node-crawler A NodeJS-based web crawler and spider that extracts data from websites. 6,713
apify/crawlee A tool for building reliable web scraping and browser automation pipelines in Node.js. 15,845
jhy/jsoup A Java library for parsing and manipulating HTML, XML, and CSS 10,963
node-formidable/formidable A module for parsing multipart form data, especially file uploads in Node.js applications. 7,060
sparklemotion/nokogiri A Ruby library for parsing and manipulating XML and HTML documents 6,159
nodeca/embedza Creates HTML snippets from URLs using data from oEmbed, Open Graph, and meta tags. 64
fb55/htmlparser2 A fast and forgiving HTML parser with a focus on minimal allocations 4,465
axios/axios An HTTP client library for making requests to web servers using the Promise API. 105,901
clean-css/clean-css A fast and efficient CSS optimizer for Node.js and modern browsers 4,167
matthewmueller/x-ray A flexible web scraping framework for extracting data from websites with customizable selectors and pagination support. 5,881
naturalintelligence/fast-xml-parser A fast and efficient JavaScript library for parsing and generating XML data 2,596
joseconstela/webparsy A Node.js library and CLI for scraping websites using Puppeteer and YAML definitions 44
javve/list.js A JavaScript library for adding search, sort, filters and flexibility to tables and lists in HTML elements. 11,207