ferret

Data extractor

A web scraping system that simplifies data extraction from the web using a declarative language and abstracts away technical complexities.

Declarative web scraping

GitHub

6k stars

101 watching

304 forks

Language: Go

last commit: over 1 year ago

Linked from 1 awesome list

cdpchromeclicrawlercrawlingdata-miningdslgogolanghacktoberfestlibraryquery-languagescraperscrapingscraping-websitestool

www.montferret.dev/

Backlinks from these awesome lists:

brucedone/awesome-crawler

Related projects:

Repository	Description	Stars
dbalmain/ferret	An information retrieval library designed to be extensible and compatible with Apache Lucene.	279
noaa-pmel/ferret	A software tool for data visualization and analysis from NOAA's Pacific Marine Environmental Laboratory.	55
jkraemer/ferret	A C-based information retrieval library providing an extensible framework for indexing and querying data	22
fimad/scalpel	A web scraping library providing a declarative interface on top of an HTML parsing library to extract data from HTML pages	325
felipecsl/wombat	A Ruby-based web crawler and data extraction tool with an elegant DSL.	1,315
tidyverse/rvest	A package for extracting data from web pages using HTML parsing and CSS/XPath selectors.	1,495
malfrats/xeuledoc	A tool to fetch information about public Google documents from various services	856
propublica/upton	A web scraping framework that simplifies the process by handling repetitive tasks and provides options for efficient data retrieval	1,612
naufalardhani/domhttpx	A tool to discover and extract information from web pages using HTTP requests and Google search queries.	68
spider-rs/spider	A tool for web data extraction and processing using Rust	1,234
slotix/dataflowkit	A framework for extracting structured data from web pages using CSS selectors.	667
benibela/xidel	A tool to extract data from web pages using various query languages and selectors.	690
holgerd77/django-dynamic-scraper	An app that allows you to manage Scrapy spiders through a Django admin interface.	1,155
gregorut/vgchartzscrape	A Python script that captures data from vgchartz.com and saves it to a CSV file	80
yhat/scrape	A collection of utility functions and tools to simplify web scraping in Go.	1,513