scrape-it

scraper

A Node.js library and CLI tool for automating web page scraping and parsing

🔮 A Node.js scraper for humans.

GitHub

4k stars
64 watching
220 forks
Language: JavaScript
last commit: 7 days ago
Linked from 1 awesome list

hacktoberfestnode-scraperscraper

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ruipgil/scraperjs A versatile web scraping module with two scrapers for static and dynamic content extraction. 3,710
rchipka/node-osmosis A fast and flexible web scraping library using native libxml C bindings 4,116
dessant/buster A browser extension that helps users solve CAPTCHA challenges by simulating human interactions. 8,023
anorov/cloudflare-scrape A tool to bypass Cloudflare's anti-bot page and access protected websites 3,391
matthewmueller/x-ray A flexible web scraping framework for extracting data from websites with customizable selectors and pagination support. 5,878
gocolly/colly A framework for extracting structured data from websites in a fast and elegant way 23,317
fabianwennink/iconcaptcha-php A captcha solution designed to be fast and user-friendly, providing an easy alternative to traditional captchas. 140
unclecode/crawl4ai A tool for web crawling and data extraction, designed to work with large language models. 16,180
finic-ai/finic Provides cloud-hosted browsers for automation and scraping tasks to avoid detection by websites. 2,296
justanotherarchivist/snscrape A Python-based social media scraper that extracts data from various platforms. 4,490
jhy/jsoup A Java library for parsing and manipulating HTML, XML, and CSS 10,949
prosopo/captcha Protects websites from bots and automated abuse by solving a challenge without collecting user data 48
workshopper/javascripting An interactive terminal-based tutorial to learn JavaScript 2,865
hoppscotch/hoppscotch An API development and testing ecosystem with a minimalistic UI, supporting various protocols and authentication methods. 65,598
apify/crawlee A tool for building reliable web scraping and browser automation pipelines in Node.js. 15,604