suckit
Web scraper
A Rust-based web scraping tool that recursively visits and downloads websites to disk.
Suck the InTernet
747 stars
10 watching
39 forks
Language: Rust
last commit: 8 months ago
Linked from 1 awesome list
hacktoberfestrustwebscraping
Related projects:
Repository | Description | Stars |
---|---|---|
spider-rs/spider | A web crawler and scraper built on top of Rust, designed to extract data from the web in a flexible and configurable manner. | 1,140 |
the-markup/blacklight-collector | A tool for scraping website content and analyzing browser behavior | 202 |
rust-scraper/scraper | A Rust library for parsing and querying HTML documents using CSS selectors. | 1,937 |
archiveteam/grab-site | A web crawler designed to backup websites by recursively crawling and writing WARC files. | 1,398 |
fimad/scalpel | A web scraping library providing a declarative interface on top of an HTML parsing library to extract data from HTML pages | 323 |
utkarshkukreti/select.rs | A Rust library for extracting useful data from HTML documents | 974 |
yhat/scrape | A collection of utility functions and tools to simplify web scraping in Go. | 1,513 |
spekulatius/phpscraper | A web scraping utility for PHP that simplifies the process of extracting information from websites. | 536 |
internetarchive/brozzler | A distributed web crawler that fetches and extracts links from websites using a real browser. | 671 |
vida-nyu/ache | A web crawler designed to efficiently collect and prioritize relevant content from the web | 454 |
jakopako/goskyr | A tool to simplify web scraping of list-like structured data from web pages | 35 |
archiveteam/wpull | Downloads and crawls web pages, allowing for the archiving of websites. | 556 |
tidyverse/rvest | A package for extracting data from web pages using HTML parsing and CSS/XPath selectors. | 1,492 |
felipecsl/wombat | A Ruby-based web crawler and data extraction tool with an elegant DSL. | 1,315 |
medialab/minet | A command line tool and Python library for extracting data from various web sources. | 286 |