ruby-readability

Content extractor

A tool for extracting readable content from web pages written in Ruby.

Port of arc90's readability project to Ruby

GitHub

925 stars
34 watching
171 forks
Language: Ruby
last commit: 3 months ago

Related projects:

Repository Description Stars
talyssonoc/commonregexruby Extracts common information from text strings in various formats 79
jacopotarantino/preloadables Provides preloading and prefetching metadata helpers for Rails applications to speed up page loading 18
philipjkim/goreadability Extracts readable content from web pages using Open Graph and traditional readability rules. 69
aymericbeaumet/squeeze A tool to extract relevant information from text 17
tjatse/node-readability Automates web page scraping and text extraction to make any webpage readable 343
jonmagic/grim A tool for extracting pages from PDFs and converting them to images and text strings. 216
joenas/readability.cr Port of readability project to Crystal programming language 12
keepcosmos/readability An Elixir library that extracts and curates primary readable content from web pages. 252
yomurb/yomu A Ruby library for extracting text and metadata from various file formats. 499
wooorm/readability A tool that calculates and visualizes the readability of written text based on various factors. 205
jaimeiniesta/metainspector A Ruby gem for web scraping and extracting metadata from web pages. 1,036
rrrene/inch Analyzes and suggests improvements to inline documentation in Ruby codebases 518
rubycocos/csvreader A gem for reading tabular data in the comma-separated values (CSV) format 178
felipecsl/wombat A Ruby-based web crawler and data extraction tool with an elegant DSL. 1,315
komposable/komponent A Ruby on Rails gem for organizing front-end code into reusable components 427