dataflowkit

Web scraper

A framework for extracting structured data from web pages using CSS selectors.

Extract structured data from web sites. Web sites scraping.

GitHub

662 stars
24 watching
80 forks
Language: Go
last commit: over 1 year ago
Linked from 4 awesome lists

cdpchrome-fetchercrawlingextract-datagogolanggolang-libraryheadlessscraperscrapingscraping-websites

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
miyagawa/web-scraper A Perl toolkit for extracting structured data from HTML documents using a DSL-like interface. 104
jakopako/goskyr A tool to simplify web scraping of list-like structured data from web pages 35
benibela/xidel A tool to extract data from web pages using various query languages and selectors. 681
davemolk/gogetjs Tools for extracting and analyzing JavaScript files from web pages 40
s0rg/crawley A utility for systematically extracting URLs from web pages and printing them to the console. 263
spider-rs/spider A web crawler and scraper built on top of Rust, designed to extract data from the web in a flexible and configurable manner. 1,140
yhat/scrape A collection of utility functions and tools to simplify web scraping in Go. 1,513
spekulatius/phpscraper A web scraping utility for PHP that simplifies the process of extracting information from websites. 536
elixir-crawly/crawly A framework for extracting structured data from websites 987
felipecsl/wombat A Ruby-based web crawler and data extraction tool with an elegant DSL. 1,315
the-markup/blacklight-collector A tool for scraping website content and analyzing browser behavior 202
stewartmckee/cobweb A flexible web crawler that can be used to extract data from websites in a scalable and efficient manner 226
propublica/upton A web scraping framework that simplifies the process by handling repetitive tasks and provides options for efficient data retrieval 1,613
joncanning/skyscraper A framework for building asynchronous web scrapers and crawlers using async/await and Reactive Extensions. 58
jaimeiniesta/metainspector A Ruby gem for web scraping and extracting metadata from web pages. 1,036