colly

Website scraper

A framework for extracting structured data from websites in a fast and elegant way

Elegant Scraper and Crawler Framework for Golang

GitHub

23k stars
332 watching
2k forks
Language: Go
last commit: 4 months ago
Linked from 3 awesome lists

crawlercrawlingframeworkgogolangscraperscrapingspider

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
geziyor/geziyor A fast and flexible web crawling and scraping framework for extracting structured data from websites. 2,629
jaeles-project/gospider A tool for web crawling and exploitation written in Go. 2,578
unclecode/crawl4ai A tool for web crawling and data extraction, designed to work with large language models. 16,180
gohugoio/hugo A fast and flexible tool for generating static websites with built-in support for various content formats. 75,938
yujiosaka/headless-chrome-crawler A distributed crawling framework that leverages Headless Chrome to scrape dynamic websites 5,527
apify/crawlee A tool for building reliable web scraping and browser automation pipelines in Node.js. 15,604
hu17889/go_spider A modular, concurrent web crawler framework written in Go. 1,826
hakluke/hakrawler A tool for automatically discovering and crawling web application endpoints and assets 4,502
ionicabizau/scrape-it A Node.js library and CLI tool for automating web page scraping and parsing 4,012
s0md3v/photon A fast and flexible web crawler designed to gather information from the internet 11,067
needmorecowbell/giggity A tool to scrape and store hierarchical data about GitHub organizations, users, or repositories. 126
golangci/golangci-lint A fast and efficient tool for running Go code checks in parallel. 15,693
puerkitobio/gocrawl A concurrent web crawler written in Go that allows flexible and polite crawling of websites. 2,038
spatie/crawler A powerful web crawler written in PHP that can execute JavaScript and crawl multiple URLs concurrently. 2,537
my8100/scrapydweb A web application for managing Scrapyd clusters, analyzing Scrapy logs, and visualizing results. 3,161