ferret
Data extractor
A web scraping system that simplifies data extraction from the web using a declarative language and abstracts away technical complexities.
Declarative web scraping
6k stars
101 watching
302 forks
Language: Go
last commit: 13 days ago
Linked from 1 awesome list
cdpchromeclicrawlercrawlingdata-miningdslgogolanghacktoberfestlibraryquery-languagescraperscrapingscraping-websitestool
Related projects:
Repository | Description | Stars |
---|---|---|
dbalmain/ferret | A C-based information retrieval library with Ruby bindings that mimics Apache Lucene's functionality. | 279 |
noaa-pmel/ferret | A software tool for data visualization and analysis from NOAA's Pacific Marine Environmental Laboratory. | 55 |
jkraemer/ferret | An information retrieval library providing an extensible and fast way to search and retrieve data | 22 |
fimad/scalpel | A web scraping library providing a declarative interface on top of an HTML parsing library to extract data from HTML pages | 323 |
felipecsl/wombat | A Ruby-based web crawler and data extraction tool with an elegant DSL. | 1,315 |
tidyverse/rvest | A package for extracting data from web pages using HTML parsing and CSS/XPath selectors. | 1,492 |
malfrats/xeuledoc | A tool to fetch information about public Google documents from various services | 846 |
propublica/upton | A web scraping framework that simplifies the process by handling repetitive tasks and provides options for efficient data retrieval | 1,613 |
naufalardhani/domhttpx | A tool to discover and extract information from web pages using HTTP requests and Google search queries. | 68 |
spider-rs/spider | A web crawler and scraper built on top of Rust, designed to extract data from the web in a flexible and configurable manner. | 1,140 |
slotix/dataflowkit | A framework for extracting structured data from web pages using CSS selectors. | 662 |
benibela/xidel | A tool to extract data from web pages using various query languages and selectors. | 681 |
holgerd77/django-dynamic-scraper | An app that allows you to manage Scrapy spiders through a Django admin interface. | 1,153 |
gregorut/vgchartzscrape | A Python script that captures data from vgchartz.com and saves it to a CSV file | 79 |
yhat/scrape | A collection of utility functions and tools to simplify web scraping in Go. | 1,513 |