ferret

Data extractor

A web scraping system that simplifies data extraction from the web using a declarative language and abstracts away technical complexities.

Declarative web scraping

GitHub

6k stars
101 watching
302 forks
Language: Go
last commit: 13 days ago
Linked from 1 awesome list

cdpchromeclicrawlercrawlingdata-miningdslgogolanghacktoberfestlibraryquery-languagescraperscrapingscraping-websitestool

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
dbalmain/ferret A C-based information retrieval library with Ruby bindings that mimics Apache Lucene's functionality. 279
noaa-pmel/ferret A software tool for data visualization and analysis from NOAA's Pacific Marine Environmental Laboratory. 55
jkraemer/ferret An information retrieval library providing an extensible and fast way to search and retrieve data 22
fimad/scalpel A web scraping library providing a declarative interface on top of an HTML parsing library to extract data from HTML pages 323
felipecsl/wombat A Ruby-based web crawler and data extraction tool with an elegant DSL. 1,315
tidyverse/rvest A package for extracting data from web pages using HTML parsing and CSS/XPath selectors. 1,492
malfrats/xeuledoc A tool to fetch information about public Google documents from various services 846
propublica/upton A web scraping framework that simplifies the process by handling repetitive tasks and provides options for efficient data retrieval 1,613
naufalardhani/domhttpx A tool to discover and extract information from web pages using HTTP requests and Google search queries. 68
spider-rs/spider A web crawler and scraper built on top of Rust, designed to extract data from the web in a flexible and configurable manner. 1,140
slotix/dataflowkit A framework for extracting structured data from web pages using CSS selectors. 662
benibela/xidel A tool to extract data from web pages using various query languages and selectors. 681
holgerd77/django-dynamic-scraper An app that allows you to manage Scrapy spiders through a Django admin interface. 1,153
gregorut/vgchartzscrape A Python script that captures data from vgchartz.com and saves it to a CSV file 79
yhat/scrape A collection of utility functions and tools to simplify web scraping in Go. 1,513