kimuraframework

Scraping framework

A web scraping framework for extracting data from JavaScript-rendered websites.

Kimurai is a modern web scraping framework written in Ruby which works out of box with Headless Chromium/Firefox, PhantomJS, or simple HTTP requests and allows to scrape and interact with JavaScript rendered websites

GitHub

1k stars

30 watching

154 forks

Language: Ruby

last commit: about 2 years ago

Linked from 1 awesome list

crawlerheadless-chromekimuraiscraperscrapy

Backlinks from these awesome lists:

markets/awesome-ruby

Related projects:

Repository	Description	Stars
zhuyingda/webster	A framework for automating web scraping and crawling tasks using Node.js	518
matiasb/demiurge	An HTML scraping framework built on top of PyQuery.	115
wardrop/scorched	A light-weight web framework for Ruby that aims to provide generic yet powerful constructs for processing HTTP requests.	275
joncanning/skyscraper	A framework for building asynchronous web scrapers and crawlers using async/await and Reactive Extensions.	59
slotix/dataflowkit	A framework for extracting structured data from web pages using CSS selectors.	667
kballenegger/kenji	A lightweight Ruby web framework for building backend APIs with clean routing and modular code organization.	32
kenichi/angelo	A DSL for building web applications with real-time capabilities using Ruby and Reel's reactor	301
guilleiguaran/nancy	A Ruby web framework that provides a minimalistic approach to building web applications with features like routing, session management and template rendering.	82
joseconstela/webparsy	A Node.js library and CLI for scraping websites using Puppeteer and YAML definitions	44
propublica/upton	A web scraping framework that simplifies the process by handling repetitive tasks and provides options for efficient data retrieval	1,612
jjelosua/doga_scraper	A tool that extracts and converts Galician Official journal documents to different formats based on input year.	0
komarserjio/notejam	A unified sample web application to facilitate learning of various server-side frameworks.	1,145
miyagawa/web-scraper	A Perl toolkit for extracting structured data from HTML documents using a DSL-like interface.	104
kimeiga/bahunya	A CSS framework that uses semantic HTML to provide a set of pre-defined UI elements without using explicit class names.	298
oscarotero/embed	A PHP library to retrieve metadata and embed code from any web page	2,100