sparklyr
Spark interface
An R interface to Apache Spark for distributed data analysis and machine learning
R interface for Apache Spark
955 stars
73 watching
310 forks
Language: R
last commit: 4 months ago
Linked from 1 awesome list
apache-sparkdistributeddplyridelivymachine-learningrremote-clustersrstatssparksparklyr
Related projects:
Repository | Description | Stars |
---|---|---|
| Provides a lightweight R interface to Apache Spark for data processing | 641 |
| A Ruby wrapper around Apache Spark's functionality for large-scale data processing | 227 |
| An open-source REST interface for interacting with Apache Spark from anywhere. | 894 |
| A set of Python libraries and tools to simplify interactions with various data sources using Apache Spark. | 61 |
| An experimental client for interacting with Apache Spark clusters from Rust. | 91 |
| Provides a Ruby client for interacting with the Cisco Spark API | 9 |
| A comprehensive resource for learning Apache Spark, covering its core concepts, components, and advanced topics. | 655 |
| An analytics engine designed to handle large-scale data processing and analysis | 40,170 |
| A testing helper library for Apache Spark applications. | 437 |
| Provides high-performance APIs for using Apache Spark with .NET | 2,032 |
| Enables spatial analysis in Apache Spark using SF functions | 58 |
| A PyTorch implementation on Apache Spark for distributed deep learning model training and inference. | 339 |
| A Clojure API for interacting with Apache Spark | 448 |
| A Python library that integrates PySpark and scikit-learn for distributed machine learning | 1,154 |
| Automates deployment of an AWS EMR cluster and execution of Spark jobs | 8 |