dataflowkit
Web scraper
A framework for extracting structured data from web pages using CSS selectors.
Extract structured data from web sites. Web sites scraping.
662 stars
24 watching
80 forks
Language: Go
last commit: over 1 year ago
Linked from 4 awesome lists
cdpchrome-fetchercrawlingextract-datagogolanggolang-libraryheadlessscraperscrapingscraping-websites
Related projects:
Repository | Description | Stars |
---|---|---|
miyagawa/web-scraper | A Perl toolkit for extracting structured data from HTML documents using a DSL-like interface. | 104 |
jakopako/goskyr | A tool to simplify web scraping of list-like structured data from web pages | 35 |
benibela/xidel | A tool to extract data from web pages using various query languages and selectors. | 681 |
davemolk/gogetjs | Tools for extracting and analyzing JavaScript files from web pages | 40 |
s0rg/crawley | A utility for systematically extracting URLs from web pages and printing them to the console. | 263 |
spider-rs/spider | A web crawler and scraper built on top of Rust, designed to extract data from the web in a flexible and configurable manner. | 1,140 |
yhat/scrape | A collection of utility functions and tools to simplify web scraping in Go. | 1,513 |
spekulatius/phpscraper | A web scraping utility for PHP that simplifies the process of extracting information from websites. | 536 |
elixir-crawly/crawly | A framework for extracting structured data from websites | 987 |
felipecsl/wombat | A Ruby-based web crawler and data extraction tool with an elegant DSL. | 1,315 |
the-markup/blacklight-collector | A tool for scraping website content and analyzing browser behavior | 202 |
stewartmckee/cobweb | A flexible web crawler that can be used to extract data from websites in a scalable and efficient manner | 226 |
propublica/upton | A web scraping framework that simplifies the process by handling repetitive tasks and provides options for efficient data retrieval | 1,613 |
joncanning/skyscraper | A framework for building asynchronous web scrapers and crawlers using async/await and Reactive Extensions. | 58 |
jaimeiniesta/metainspector | A Ruby gem for web scraping and extracting metadata from web pages. | 1,036 |