scrape-it

scraper

A Node.js library and CLI tool for automating web page scraping and parsing

🔮 A Node.js scraper for humans.

GitHub

4k stars
62 watching
220 forks
Language: JavaScript
last commit: 2 months ago
Linked from 1 awesome list

hacktoberfestnode-scraperscraper

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ruipgil/scraperjs A versatile web scraping module with two scrapers for static and dynamic content extraction. 3,714
rchipka/node-osmosis A fast and flexible web scraping library using native libxml C bindings 4,115
dessant/buster A browser extension that helps users solve CAPTCHA challenges by simulating human interactions. 8,097
anorov/cloudflare-scrape A tool to bypass Cloudflare's anti-bot page and access protected websites 3,406
matthewmueller/x-ray A flexible web scraping framework for extracting data from websites with customizable selectors and pagination support. 5,883
gocolly/colly A framework for extracting structured data from websites in a fast and elegant way 23,444
fabianwennink/iconcaptcha-php A captcha solution designed to be fast and user-friendly, providing an easy alternative to traditional captchas. 144
unclecode/crawl4ai A web crawling tool designed to extract structured data from the web for use in AI applications 18,541
finic-ai/finic Provides cloud-hosted browsers for automation and scraping tasks to avoid detection by websites. 2,311
justanotherarchivist/snscrape A Python-based social media scraper that extracts data from various platforms. 4,557
jhy/jsoup A Java library for parsing and manipulating HTML, XML, and CSS 10,985
prosopo/captcha Protects websites from bots and automated abuse by solving a challenge without collecting user data 50
workshopper/javascripting An interactive terminal-based tutorial to learn JavaScript 2,871
hoppscotch/hoppscotch An API development and testing ecosystem with a minimalistic UI, supporting various protocols and authentication methods. 66,110
apify/crawlee A tool for building reliable web scraping and browser automation pipelines in Node.js. 16,081