ferret

Data extractor

A web scraping system that simplifies data extraction from the web using a declarative language and abstracts away technical complexities.

Declarative web scraping

GitHub

6k stars
101 watching
304 forks
Language: Go
last commit: 3 months ago
Linked from 1 awesome list

cdpchromeclicrawlercrawlingdata-miningdslgogolanghacktoberfestlibraryquery-languagescraperscrapingscraping-websitestool

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
dbalmain/ferret An information retrieval library designed to be extensible and compatible with Apache Lucene. 279
noaa-pmel/ferret A software tool for data visualization and analysis from NOAA's Pacific Marine Environmental Laboratory. 55
jkraemer/ferret A C-based information retrieval library providing an extensible framework for indexing and querying data 22
fimad/scalpel A web scraping library providing a declarative interface on top of an HTML parsing library to extract data from HTML pages 325
felipecsl/wombat A Ruby-based web crawler and data extraction tool with an elegant DSL. 1,315
tidyverse/rvest A package for extracting data from web pages using HTML parsing and CSS/XPath selectors. 1,495
malfrats/xeuledoc A tool to fetch information about public Google documents from various services 856
propublica/upton A web scraping framework that simplifies the process by handling repetitive tasks and provides options for efficient data retrieval 1,612
naufalardhani/domhttpx A tool to discover and extract information from web pages using HTTP requests and Google search queries. 68
spider-rs/spider A tool for web data extraction and processing using Rust 1,234
slotix/dataflowkit A framework for extracting structured data from web pages using CSS selectors. 667
benibela/xidel A tool to extract data from web pages using various query languages and selectors. 690
holgerd77/django-dynamic-scraper An app that allows you to manage Scrapy spiders through a Django admin interface. 1,155
gregorut/vgchartzscrape A Python script that captures data from vgchartz.com and saves it to a CSV file 80
yhat/scrape A collection of utility functions and tools to simplify web scraping in Go. 1,513