scraperjs
Web scraper
A versatile web scraping module with two scrapers for static and dynamic content extraction.
A complete and versatile web scraper.
4k stars
94 watching
188 forks
Language: JavaScript
last commit: about 4 years ago
Linked from 2 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
ionicabizau/scrape-it | A Node.js library and CLI tool for automating web page scraping and parsing | 4,012 |
rchipka/node-osmosis | A fast and flexible web scraping library using native libxml C bindings | 4,116 |
apify/crawlee | A tool for building reliable web scraping and browser automation pipelines in Node.js. | 15,604 |
bda-research/node-crawler | A NodeJS-based web crawler and spider that extracts data from websites. | 6,704 |
holgerd77/django-dynamic-scraper | An app that allows you to manage Scrapy spiders through a Django admin interface. | 1,153 |
spatie/crawler | A powerful web crawler written in PHP that can execute JavaScript and crawl multiple URLs concurrently. | 2,537 |
paperjs/paper.js | A JavaScript library and framework for creating vector graphics applications using HTML5 Canvas | 14,507 |
veliovgroup/spiderable-middleware | intercepts requests from web crawlers and proxies them to a prerendering service for rendering HTML | 38 |
benibela/xidel | A tool to extract data from web pages using various query languages and selectors. | 681 |
javve/list.js | A JavaScript library for adding search, sort, filters and flexibility to tables and lists in HTML elements. | 11,204 |
tjatse/node-readability | Automates web page scraping and text extraction to make any webpage readable | 343 |
spider-rs/spider | A web crawler and scraper built on top of Rust, designed to extract data from the web in a flexible and configurable manner. | 1,140 |
jmcarp/robobrowser | A Python library for interacting with web pages without the need for a standalone browser | 3,702 |
macbre/phantomas | A tool for collecting and monitoring web performance metrics in a headless Chromium browser environment. | 2,258 |
joseconstela/webparsy | A Node.js library and CLI for scraping websites using Puppeteer and YAML definitions | 44 |