Beanbun

crawler framework

A PHP framework for building distributed web crawlers with modular design and extensibility

Beanbun 是用 PHP 编写的多进程网络爬虫框架，具有良好的开放性、高可扩展性，基于 Workerman。

GitHub

1k stars

77 watching

252 forks

Language: PHP

last commit: over 3 years ago

Linked from 1 awesome list

beanbuncrawlerphpspider

Backlinks from these awesome lists:

jingwentian/awesome-php

Related projects:

Repository	Description	Stars
codesofun/web-bee	A Java framework for building web-based crawlers with features like distributed crawling and proxy support.	189
zhegexiaohuozi/seimicrawler	A distributed crawler framework that simplifies the process of building crawlers using Spring Boot and Redis	1,980
hu17889/go_spider	A modular, concurrent web crawler framework written in Go.	1,827
xianhu/pspider	A Python web crawler framework with support for multi-threading and proxy usage.	1,828
qinxuye/cola	A high-level framework for building distributed data extractors from web pages	1,501
chenjiandongx/github-spider	A Python-based web crawler for scraping Github user and repository data.	264
feng19/spider_man	A high-level web crawling and scraping framework for Elixir.	23
crawlzone/crawlzone	A PHP framework for asynchronous internet crawling and web scraping	78
brendonboshell/supercrawler	A web crawler designed to crawl websites while obeying robots.txt rules, rate limits and concurrency limits, with customizable content handlers for parsing and processing crawled pages.	380
puerkitobio/fetchbot	A flexible web crawler that follows robots.txt policies and crawl delays.	787
antchfx/antch	A framework for building fast and efficient web crawlers and scrapers in Go.	261
wspl/creeper	A framework for building cross-platform web crawlers using Go	780
joncanning/skyscraper	A framework for building asynchronous web scrapers and crawlers using async/await and Reactive Extensions.	59
howie6879/ruia	An async web scraping micro-framework built with asyncio and aiohttp to simplify URL crawling	1,753
bixuehujin/blink	A high-performance web framework and application server built on top of PHP.	833