Beanbun

crawler framework

A PHP framework for building distributed web crawlers with modular design and extensibility

Beanbun 是用 PHP 编写的多进程网络爬虫框架,具有良好的开放性、高可扩展性,基于 Workerman。

GitHub

1k stars
77 watching
252 forks
Language: PHP
last commit: over 1 year ago
Linked from 1 awesome list

beanbuncrawlerphpspider

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
codesofun/web-bee A Java framework for building web-based crawlers with features like distributed crawling and proxy support. 189
zhegexiaohuozi/seimicrawler A distributed crawler framework that simplifies the process of building crawlers using Spring Boot and Redis 1,980
hu17889/go_spider A modular, concurrent web crawler framework written in Go. 1,827
xianhu/pspider A Python web crawler framework with support for multi-threading and proxy usage. 1,828
qinxuye/cola A high-level framework for building distributed data extractors from web pages 1,501
chenjiandongx/github-spider A Python-based web crawler for scraping Github user and repository data. 264
feng19/spider_man A high-level web crawling and scraping framework for Elixir. 23
crawlzone/crawlzone A PHP framework for asynchronous internet crawling and web scraping 78
brendonboshell/supercrawler A web crawler designed to crawl websites while obeying robots.txt rules, rate limits and concurrency limits, with customizable content handlers for parsing and processing crawled pages. 380
puerkitobio/fetchbot A flexible web crawler that follows robots.txt policies and crawl delays. 787
antchfx/antch A framework for building fast and efficient web crawlers and scrapers in Go. 261
wspl/creeper A framework for building cross-platform web crawlers using Go 780
joncanning/skyscraper A framework for building asynchronous web scrapers and crawlers using async/await and Reactive Extensions. 59
howie6879/ruia An async web scraping micro-framework built with asyncio and aiohttp to simplify URL crawling 1,753
bixuehujin/blink A high-performance web framework and application server built on top of PHP. 833