scraperjs

Web scraper

A versatile web scraping module with two scrapers for static and dynamic content extraction.

A complete and versatile web scraper.

4k stars

94 watching

188 forks

Language: JavaScript

last commit: almost 6 years ago

Linked from 2 awesome lists

Backlinks from these awesome lists:

Related projects:

Repository	Description	Stars
ionicabizau/scrape-it	A Node.js library and CLI tool for automating web page scraping and parsing	4,024
rchipka/node-osmosis	A fast and flexible web scraping library using native libxml C bindings	4,115
apify/crawlee	A tool for building reliable web scraping and browser automation pipelines in Node.js.	16,081
bda-research/node-crawler	A NodeJS-based web crawler and spider that extracts data from websites.	6,718
holgerd77/django-dynamic-scraper	An app that allows you to manage Scrapy spiders through a Django admin interface.	1,155
spatie/crawler	A powerful web crawler written in PHP that can execute JavaScript and crawl multiple URLs concurrently.	2,552
paperjs/paper.js	A JavaScript library and framework for creating vector graphics applications using HTML5 Canvas	14,555
veliovgroup/spiderable-middleware	intercepts requests from web crawlers and proxies them to a prerendering service for rendering HTML	39
benibela/xidel	A tool to extract data from web pages using various query languages and selectors.	690
javve/list.js	A JavaScript library for adding search, sort, filters and flexibility to tables and lists in HTML elements.	11,207
tjatse/node-readability	Automates web page scraping and text extraction to make any webpage readable	343
spider-rs/spider	A tool for web data extraction and processing using Rust	1,234
jmcarp/robobrowser	A Python library for interacting with web pages without the need for a standalone browser	3,703
macbre/phantomas	A tool for collecting and monitoring web performance metrics in a headless Chromium browser environment.	2,257
joseconstela/webparsy	A Node.js library and CLI for scraping websites using Puppeteer and YAML definitions	44