scala-scraper

HTML parser DSL

A Scala library that provides a domain-specific language (DSL) for parsing and extracting content from HTML pages.

A Scala library for scraping content from HTML pages

GitHub

717 stars
29 watching
106 forks
Language: Scala
last commit: 14 days ago
dslhacktoberfesthtml-parsingscalascraper

Related projects:

Repository Description Stars
bplawler/crawler A Scala-based DSL for programmatically accessing and interacting with web pages 148
felipecsl/wombat A Ruby-based web crawler and data extraction tool with an elegant DSL. 1,315
sparsetech/pine A functional HTML5 and XML library for the Scala platform, providing type-safe bindings and rendering capabilities. 107
miyagawa/web-scraper A Perl toolkit for extracting structured data from HTML documents using a DSL-like interface. 104
epfl-lara/scallion A library for parsing and pretty-printing LL(1) languages in Scala. 56
phenoscape/scowl A Scala DSL for creating OWL expressions and axioms using the OWL API 56
scrapy/scrapely A pure-python library for extracting structured data from HTML pages. 1,863
egonschiele/handsomesoup A Haskell library that simplifies HTML parsing by providing CSS selectors and attribute extraction functions. 124
ndmitchell/tagsoup A Haskell library for parsing and extracting information from HTML/XML documents 233
fimad/scalpel A web scraping library providing a declarative interface on top of an HTML parsing library to extract data from HTML pages 323
opal/paggio A Ruby DSL to generate HTML and CSS with support for Sinatra integration, Markdown rendering, Opal script execution, and local method calling. 42
lambdaworks/scurl-detector Detects and extracts URLs from text in written content 16
benibela/xidel A tool to extract data from web pages using various query languages and selectors. 681
thoughtworksinc/dsl.scala A framework to create embedded Domain-Specific Languages in Scala 254
scalafx/scalafx A UI DSL written in Scala that simplifies the creation of JavaFX-based user interfaces 675