lambdasoup

HTML scraper

A functional HTML scraping and manipulation library in OCaml

Functional HTML scraping and rewriting with CSS in OCaml

GitHub

384 stars
12 watching
31 forks
Language: OCaml
last commit: 2 months ago
Linked from 1 awesome list

csshtmlocamlscrapingsoup

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
aantron/markup.ml A streaming HTML5 and XML parser that detects character encodings, emits signals, and provides error recovery. 146
fcannizzaro/jsoup-annotations A Java library that provides annotations to simplify HTML scraping and processing with Jsoup 239
miyagawa/web-scraper A Perl toolkit for extracting structured data from HTML documents using a DSL-like interface. 104
meilisearch/docs-scraper Automates scraping and indexing of documentation content into a search engine 297
pharo-contributions/soup An HTML parsing and scraping library for Pharo 6
egonschiele/handsomesoup A Haskell library that simplifies HTML parsing by providing CSS selectors and attribute extraction functions. 123
oscarotero/embed A PHP library to retrieve metadata and embed code from any web page 2,100
felipecsl/wombat A Ruby-based web crawler and data extraction tool with an elegant DSL. 1,315
bendeaton/abaqus-documentation-scraper Extracts keywords and parameters from Abaqus documentation for syntax highlighting plugin 4
jakopako/goskyr A tool to simplify web scraping of list-like structured data from web pages 36
the-markup/blacklight-collector A tool for scraping website content and analyzing browser behavior 205
malfrats/xeuledoc A tool to fetch information about public Google documents from various services 856
scrapy/scrapely A pure-python library for extracting structured data from HTML pages. 1,865
michaelhelmick/lassie Library for retrieving basic content from websites 615
laramies/metagoofil Extracts metadata from public documents found on websites, useful for brute-force attacks. 1,050