rvest
Web scraper
A package for extracting data from web pages using HTML parsing and CSS/XPath selectors.
Simple web scraping for R
1k stars
88 watching
343 forks
Language: R
last commit: 27 days ago
Linked from 2 awesome lists
htmlrweb-scraping
Related projects:
Repository | Description | Stars |
---|---|---|
yusuzech/r-web-scraping-cheat-sheet | A comprehensive guide to web scraping using R, covering various techniques and libraries. | 385 |
spider-rs/spider | A web crawler and scraper built on top of Rust, designed to extract data from the web in a flexible and configurable manner. | 1,140 |
tidyverse/tidyr | Helps create organized data with minimal manual work | 1,384 |
tidyverse/reprex | Tool to create and render reproducible R code snippets for sharing | 741 |
tidyverse/readr | Provides fast and friendly way to read rectangular data from delimited files | 1,010 |
tidyverse/tidyverse | A set of R packages designed to work together harmoniously for data science tasks. | 1,669 |
jaimeiniesta/metainspector | A Ruby gem for web scraping and extracting metadata from web pages. | 1,036 |
rust-scraper/scraper | A Rust library for parsing and querying HTML documents using CSS selectors. | 1,937 |
miyagawa/web-scraper | A Perl toolkit for extracting structured data from HTML documents using a DSL-like interface. | 104 |
slotix/dataflowkit | A framework for extracting structured data from web pages using CSS selectors. | 662 |
scrapy/scrapely | A pure-python library for extracting structured data from HTML pages. | 1,863 |
zhuyingda/webster | A framework for automating web scraping and crawling tasks using Node.js | 515 |
tidyverse/stringr | A set of functions for efficiently manipulating strings in R | 605 |
felipecsl/wombat | A Ruby-based web crawler and data extraction tool with an elegant DSL. | 1,315 |
benibela/xidel | A tool to extract data from web pages using various query languages and selectors. | 681 |