antch

Web crawler

A framework for building fast and efficient web crawlers and scrapers in Go.

Antch, a fast, powerful and extensible web crawling & scraping framework for Go

GitHub

261 stars
16 watching
41 forks
Language: Go
last commit: over 4 years ago
Linked from 2 awesome lists

crawlercrawlingframeworkgolangscrapingweb-crawlerweb-spider

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
hu17889/go_spider A modular, concurrent web crawler framework written in Go. 1,827
wspl/creeper A framework for building cross-platform web crawlers using Go 780
antchfx/htmlquery A Golang package for extracting data from HTML documents using XPath expressions. 744
puerkitobio/gocrawl A concurrent web crawler written in Go that allows flexible and polite crawling of websites. 2,036
feng19/spider_man A high-level web crawling and scraping framework for Elixir. 23
elixir-crawly/crawly A framework for extracting structured data from websites 994
iamstoxe/urlgrab A tool to crawl websites by exploring links recursively with support for JavaScript rendering. 331
fmpwizard/owlcrawler A distributed web crawler that coordinates crawling tasks across multiple worker processes using a message bus. 55
jmg/crawley A Pythonic framework for building high-speed web crawlers with flexible data extraction and storage options. 188
brendonboshell/supercrawler A web crawler designed to crawl websites while obeying robots.txt rules, rate limits and concurrency limits, with customizable content handlers for parsing and processing crawled pages. 380
fredwu/crawler A high-performance web crawling and scraping solution with customizable settings and worker pooling. 945
yhat/scrape A collection of utility functions and tools to simplify web scraping in Go. 1,513
antchfx/xpath Provides a Go package for querying and selecting nodes from various document types using XPath expressions. 696
zhegexiaohuozi/seimicrawler A distributed crawler framework that simplifies the process of building crawlers using Spring Boot and Redis 1,980
slotix/dataflowkit A framework for extracting structured data from web pages using CSS selectors. 667