spider_man
Crawler library
A high-level web crawling and scraping framework for Elixir.
SpiderMan,a base-on Broadway fast high-level web crawling & scraping framework for Elixir.
23 stars
4 watching
4 forks
Language: Elixir
last commit: 12 months ago
Linked from 1 awesome list
crawlerdata-miningelixirerlangframeworkspider
Related projects:
Repository | Description | Stars |
---|---|---|
| A framework for extracting structured data from websites | 994 |
| A high-performance web crawling and scraping solution with customizable settings and worker pooling. | 945 |
| A modular, concurrent web crawler framework written in Go. | 1,827 |
| A Ruby web crawling library that provides flexible and customizable methods to crawl websites | 809 |
| An Erlang-based web crawler designed to be scalable and highly configurable | 330 |
| A Python-based web crawler for scraping Github user and repository data. | 264 |
| A Python web crawling framework utilizing asyncio and aiohttp for efficient data extraction from websites. | 2,037 |
| A tool for web data extraction and processing using Rust | 1,234 |
| An async web scraping micro-framework built with asyncio and aiohttp to simplify URL crawling | 1,753 |
| A framework for building fast and efficient web crawlers and scrapers in Go. | 261 |
| A web crawling library for .NET that allows customizable crawling and throttling of websites. | 248 |
| A high-level framework for building distributed data extractors from web pages | 1,501 |
| A web crawling framework written in Scala that allows users to define the start URL and parse response from it | 113 |
| A web scraping and automation tool for Elixir. | 30 |
| A Python web crawler framework with support for multi-threading and proxy usage. | 1,828 |