lambdasoup

HTML scraper

A functional HTML scraping and manipulation library in OCaml

Functional HTML scraping and rewriting with CSS in OCaml

384 stars

12 watching

31 forks

Language: OCaml

last commit: over 1 year ago

Linked from 1 awesome list

csshtmlocamlscrapingsoup

Screenshot of aantron/lambdasoup website

aantron.github.io/lambdasoup

Backlinks from these awesome lists:

ocaml-community/awesome-ocaml

Related projects:

Repository	Description	Stars
aantron/markup.ml	A streaming HTML5 and XML parser that detects character encodings, emits signals, and provides error recovery.	146
fcannizzaro/jsoup-annotations	A Java library that provides annotations to simplify HTML scraping and processing with Jsoup	239
miyagawa/web-scraper	A Perl toolkit for extracting structured data from HTML documents using a DSL-like interface.	104
meilisearch/docs-scraper	Automates scraping and indexing of documentation content into a search engine	297
pharo-contributions/soup	An HTML parsing and scraping library for Pharo	6
egonschiele/handsomesoup	A Haskell library that simplifies HTML parsing by providing CSS selectors and attribute extraction functions.	123
oscarotero/embed	A PHP library to retrieve metadata and embed code from any web page	2,100
felipecsl/wombat	A Ruby-based web crawler and data extraction tool with an elegant DSL.	1,315
bendeaton/abaqus-documentation-scraper	Extracts keywords and parameters from Abaqus documentation for syntax highlighting plugin	4
jakopako/goskyr	A tool to simplify web scraping of list-like structured data from web pages	36
the-markup/blacklight-collector	A tool for scraping website content and analyzing browser behavior	205
malfrats/xeuledoc	A tool to fetch information about public Google documents from various services	856
scrapy/scrapely	A pure-python library for extracting structured data from HTML pages.	1,865
michaelhelmick/lassie	Library for retrieving basic content from websites	615
laramies/metagoofil	Extracts metadata from public documents found on websites, useful for brute-force attacks.	1,050