blacklight-collector
Website scraper
A tool for scraping website content and analyzing browser behavior
205 stars
14 watching
36 forks
Language: TypeScript
last commit: 2 months ago Related projects:
Repository | Description | Stars |
---|---|---|
benibela/xidel | A tool to extract data from web pages using various query languages and selectors. | 690 |
spekulatius/phpscraper | A web scraping utility for PHP that simplifies the process of extracting information from websites. | 544 |
skallwar/suckit | A Rust-based web scraping tool that recursively visits and downloads websites to disk. | 750 |
rust-scraper/scraper | A Rust library for parsing and querying HTML documents using CSS selectors. | 1,961 |
miyagawa/web-scraper | A Perl toolkit for extracting structured data from HTML documents using a DSL-like interface. | 104 |
slotix/dataflowkit | A framework for extracting structured data from web pages using CSS selectors. | 667 |
propublica/upton | A web scraping framework that simplifies the process by handling repetitive tasks and provides options for efficient data retrieval | 1,612 |
martinsbalodis/web-scraper-chrome-extension | A web scraping tool integrated into a Chrome browser extension | 1,318 |
spider-rs/spider | A tool for web data extraction and processing using Rust | 1,234 |
felipecsl/wombat | A Ruby-based web crawler and data extraction tool with an elegant DSL. | 1,315 |
scrapy/scrapely | A pure-python library for extracting structured data from HTML pages. | 1,865 |
fimad/scalpel | A web scraping library providing a declarative interface on top of an HTML parsing library to extract data from HTML pages | 325 |
tjatse/node-readability | Automates web page scraping and text extraction to make any webpage readable | 343 |
medialab/minet | A command line tool and Python library for extracting data from various web sources. | 293 |
archiveteam/wpull | Downloads and crawls web pages, allowing for the archiving of websites. | 556 |