crawler

crawler

Performs web page crawling at high performance.

gRPC web crawler turbo charged for performance

51 stars

5 watching

2 forks

Language: Rust

last commit: almost 2 years ago

Linked from 1 awesome list

a11ywatchcrawlergrpcscraper

Screenshot of a11ywatch/crawler website

docs.rs/crate/website_crawler/latest

Backlinks from these awesome lists:

brucedone/awesome-crawler

Related projects:

Repository	Description	Stars
archiveteam/grab-site	A web crawler designed to backup websites by recursively crawling and writing WARC files.	1,406
webrecorder/browsertrix-crawler	A containerized browser-based crawler system for capturing web content in a high-fidelity and customizable manner.	677
elliotgao2/gain	A Python web crawling framework utilizing asyncio and aiohttp for efficient data extraction from websites.	2,037
archiveteam/wpull	Downloads and crawls web pages, allowing for the archiving of websites.	556
fredwu/crawler	A high-performance web crawling and scraping solution with customizable settings and worker pooling.	945
puerkitobio/fetchbot	A flexible web crawler that follows robots.txt policies and crawl delays.	787
brendonboshell/supercrawler	A web crawler designed to crawl websites while obeying robots.txt rules, rate limits and concurrency limits, with customizable content handlers for parsing and processing crawled pages.	380
internetarchive/brozzler	A distributed web crawler that fetches and extracts links from websites using a real browser.	678
crypto-crawler/crypto-crawler-rs	A Rust-based library for building and managing cryptocurrency crawlers	235
joenorton/rubyretriever	A Ruby-based tool for web crawling and data extraction, aiming to be a replacement for paid software in the SEO space.	143
vida-nyu/ache	A web crawler designed to efficiently collect and prioritize relevant content from the web	459
amoilanen/js-crawler	A Node.js module for crawling web sites and scraping their content	254
cocrawler/cocrawler	A versatile web crawler built with modern tools and concurrency to handle various crawl tasks	188
spider-rs/spider	A tool for web data extraction and processing using Rust	1,234
howie6879/ruia	An async web scraping micro-framework built with asyncio and aiohttp to simplify URL crawling	1,753