kimuraframework
Scraping framework
A web scraping framework for extracting data from JavaScript-rendered websites.
Kimurai is a modern web scraping framework written in Ruby which works out of box with Headless Chromium/Firefox, PhantomJS, or simple HTTP requests and allows to scrape and interact with JavaScript rendered websites
1k stars
30 watching
154 forks
Language: Ruby
last commit: 10 months ago
Linked from 1 awesome list
crawlerheadless-chromekimuraiscraperscrapy
Related projects:
Repository | Description | Stars |
---|---|---|
| A framework for automating web scraping and crawling tasks using Node.js | 518 |
| An HTML scraping framework built on top of PyQuery. | 115 |
| A light-weight web framework for Ruby that aims to provide generic yet powerful constructs for processing HTTP requests. | 275 |
| A framework for building asynchronous web scrapers and crawlers using async/await and Reactive Extensions. | 59 |
| A framework for extracting structured data from web pages using CSS selectors. | 667 |
| A lightweight Ruby web framework for building backend APIs with clean routing and modular code organization. | 32 |
| A DSL for building web applications with real-time capabilities using Ruby and Reel's reactor | 301 |
| A Ruby web framework that provides a minimalistic approach to building web applications with features like routing, session management and template rendering. | 82 |
| A Node.js library and CLI for scraping websites using Puppeteer and YAML definitions | 44 |
| A web scraping framework that simplifies the process by handling repetitive tasks and provides options for efficient data retrieval | 1,612 |
| A tool that extracts and converts Galician Official journal documents to different formats based on input year. | 0 |
| A unified sample web application to facilitate learning of various server-side frameworks. | 1,145 |
| A Perl toolkit for extracting structured data from HTML documents using a DSL-like interface. | 104 |
| A CSS framework that uses semantic HTML to provide a set of pre-defined UI elements without using explicit class names. | 298 |
| A PHP library to retrieve metadata and embed code from any web page | 2,100 |