scraperjs

Web scraper

A versatile web scraping module with two scrapers for static and dynamic content extraction.

A complete and versatile web scraper.

GitHub

4k stars
94 watching
188 forks
Language: JavaScript
last commit: about 4 years ago
Linked from 2 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ionicabizau/scrape-it A Node.js library and CLI tool for automating web page scraping and parsing 4,012
rchipka/node-osmosis A fast and flexible web scraping library using native libxml C bindings 4,116
apify/crawlee A tool for building reliable web scraping and browser automation pipelines in Node.js. 15,604
bda-research/node-crawler A NodeJS-based web crawler and spider that extracts data from websites. 6,704
holgerd77/django-dynamic-scraper An app that allows you to manage Scrapy spiders through a Django admin interface. 1,153
spatie/crawler A powerful web crawler written in PHP that can execute JavaScript and crawl multiple URLs concurrently. 2,537
paperjs/paper.js A JavaScript library and framework for creating vector graphics applications using HTML5 Canvas 14,507
veliovgroup/spiderable-middleware intercepts requests from web crawlers and proxies them to a prerendering service for rendering HTML 38
benibela/xidel A tool to extract data from web pages using various query languages and selectors. 681
javve/list.js A JavaScript library for adding search, sort, filters and flexibility to tables and lists in HTML elements. 11,204
tjatse/node-readability Automates web page scraping and text extraction to make any webpage readable 343
spider-rs/spider A web crawler and scraper built on top of Rust, designed to extract data from the web in a flexible and configurable manner. 1,140
jmcarp/robobrowser A Python library for interacting with web pages without the need for a standalone browser 3,702
macbre/phantomas A tool for collecting and monitoring web performance metrics in a headless Chromium browser environment. 2,258
joseconstela/webparsy A Node.js library and CLI for scraping websites using Puppeteer and YAML definitions 44