blacklight-collector

Website scraper

A tool for scraping website content and analyzing browser behavior

205 stars

14 watching

36 forks

Language: TypeScript

last commit: over 1 year ago

Related projects:

Repository	Description	Stars
benibela/xidel	A tool to extract data from web pages using various query languages and selectors.	690
spekulatius/phpscraper	A web scraping utility for PHP that simplifies the process of extracting information from websites.	544
skallwar/suckit	A Rust-based web scraping tool that recursively visits and downloads websites to disk.	750
rust-scraper/scraper	A Rust library for parsing and querying HTML documents using CSS selectors.	1,961
miyagawa/web-scraper	A Perl toolkit for extracting structured data from HTML documents using a DSL-like interface.	104
slotix/dataflowkit	A framework for extracting structured data from web pages using CSS selectors.	667
propublica/upton	A web scraping framework that simplifies the process by handling repetitive tasks and provides options for efficient data retrieval	1,612
martinsbalodis/web-scraper-chrome-extension	A web scraping tool integrated into a Chrome browser extension	1,318
spider-rs/spider	A tool for web data extraction and processing using Rust	1,234
felipecsl/wombat	A Ruby-based web crawler and data extraction tool with an elegant DSL.	1,315
scrapy/scrapely	A pure-python library for extracting structured data from HTML pages.	1,865
fimad/scalpel	A web scraping library providing a declarative interface on top of an HTML parsing library to extract data from HTML pages	325
tjatse/node-readability	Automates web page scraping and text extraction to make any webpage readable	343
medialab/minet	A command line tool and Python library for extracting data from various web sources.	293
archiveteam/wpull	Downloads and crawls web pages, allowing for the archiving of websites.	556