MSpider

Web Crawler

A Python-based tool for web crawling and data collection from various websites

Spider

348 stars

55 watching

191 forks

Language: Python

last commit: about 4 years ago

Linked from 1 awesome list

mspider

Backlinks from these awesome lists:

brucedone/awesome-crawler

Related projects:

Repository	Description	Stars
rivermont/spidy	A simple command-line web crawler that automatically extracts links from web pages and can be run in parallel for efficient crawling	340
jmg/crawley	A Pythonic framework for building high-speed web crawlers with flexible data extraction and storage options.	188
xianhu/pspider	A Python web crawler framework with support for multi-threading and proxy usage.	1,828
hightman/pspider	A parallel web crawler framework built using PHP and MySQLi	266
stewartmckee/cobweb	A flexible web crawler that can be used to extract data from websites in a scalable and efficient manner	226
medialab/minet	A command line tool and Python library for extracting data from various web sources.	293
mvdbos/php-spider	A flexible PHP web crawler with configurable traversal algorithms and filters.	1,336
chenjiandongx/github-spider	A Python-based web crawler for scraping Github user and repository data.	264
feng19/spider_man	A high-level web crawling and scraping framework for Elixir.	23
elliotgao2/gain	A Python web crawling framework utilizing asyncio and aiohttp for efficient data extraction from websites.	2,037
postmodern/spidr	A Ruby web crawling library that provides flexible and customizable methods to crawl websites	809
spider-rs/spider	A tool for web data extraction and processing using Rust	1,234
fmpwizard/owlcrawler	A distributed web crawler that coordinates crawling tasks across multiple worker processes using a message bus.	55
3nock/spidersuite	A cross-platform web spider/crawler tool for analyzing and mapping attack surfaces	614
holgerd77/django-dynamic-scraper	An app that allows you to manage Scrapy spiders through a Django admin interface.	1,155