RHadoop

Big data toolkit

A collection of reusable components and libraries for big data processing and analysis

RHadoop

GitHub

763 stars
154 watching
277 forks
last commit: about 9 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
google-research/rlds A toolkit for storing and manipulating episodic data in reinforcement learning and related tasks. 302
gopherdata/resources A collection of Go-based resources and tools for data science tasks 879
rmcelreath/statrethinking_winter2019 A repository of statistical modeling materials and resources for an R course on Bayesian inference 2,018
rdong08/spatialdwls_dataset Provides code and data for a spatial decision-making tool 12
drewrwilson/toolsforactivism A curated list of digital tools for activism and social change 973
jdidion/biotools A collection of bioinformatics tools and resources organized by topic and category 590
rjt1990/pydata2016-sanfrancisco An analysis of time series methods using PyFlux library and incorporating NFL prediction model. 23
reanahub/reana A platform for structuring and reusing research data analysis workflows in a reproducible manner 127
benmarwick/rrtools Tools for creating reproducible research projects in R using Quarto and version control. 680
ropensci/drake Tools and infrastructure to streamline workflow management and reproducibility in R-based data science projects. 1,343
saberma/ruby-dev-bookmarks A curated list of Ruby development resources and tools 414
braverock/performanceanalytics A package of econometric functions for analyzing financial performance and risk 211
basilesimon/datajournalists-toolbox A collection of curated tools and resources for datajournalists to analyze and visualize their data 43
google-research/deep_ope Provides benchmarking policies and datasets for offline reinforcement learning 85
chen1649chenli/dataopsresource A curated list of resources and tools for managing data operations in cultural heritage organizations 24