scalpel

Web Scraper

A web scraping library providing a declarative interface on top of an HTML parsing library to extract data from HTML pages

A high level web scraping library for Haskell.

GitHub

323 stars
13 watching
43 forks
Language: Haskell
last commit: 8 days ago

Related projects:

Repository Description Stars
miyagawa/web-scraper A Perl toolkit for extracting structured data from HTML documents using a DSL-like interface. 104
bplawler/crawler A Scala-based DSL for programmatically accessing and interacting with web pages 148
skallwar/suckit A Rust-based web scraping tool that recursively visits and downloads websites to disk. 747
benibela/xidel A tool to extract data from web pages using various query languages and selectors. 686
felipecsl/wombat A Ruby-based web crawler and data extraction tool with an elegant DSL. 1,315
propublica/upton A web scraping framework that simplifies the process by handling repetitive tasks and provides options for efficient data retrieval 1,613
ruippeixotog/scala-scraper A Scala library that provides a domain-specific language (DSL) for parsing and extracting content from HTML pages. 717
spekulatius/phpscraper A web scraping utility for PHP that simplifies the process of extracting information from websites. 536
egonschiele/handsomesoup A Haskell library that simplifies HTML parsing by providing CSS selectors and attribute extraction functions. 124
the-markup/blacklight-collector A tool for scraping website content and analyzing browser behavior 202
ndmitchell/tagsoup A Haskell library for parsing and extracting information from HTML/XML documents 233
meilisearch/docs-scraper Automates scraping and indexing of documentation content into a search engine 290
medialab/minet A command line tool and Python library for extracting data from various web sources. 286
yhat/scrape A collection of utility functions and tools to simplify web scraping in Go. 1,513
crufter/haquery Allows HTML manipulation and template rendering using CSS selectors familiar to web developers 65