sparklyr

Spark interface

An R interface to Apache Spark for distributed data analysis and machine learning

R interface for Apache Spark

GitHub

957 stars
73 watching
310 forks
Language: R
last commit: 26 days ago
Linked from 1 awesome list

apache-sparkdistributeddplyridelivymachine-learningrremote-clustersrstatssparksparklyr

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
amplab-extras/sparkr-pkg Provides a lightweight R interface to Apache Spark for data processing 641
ondra-m/ruby-spark A Ruby wrapper around Apache Spark's functionality for large-scale data processing 227
apache/incubator-livy An open-source REST interface for interacting with Apache Spark from anywhere. 889
tubular/sparkly A set of Python libraries and tools to simplify interactions with various data sources using Apache Spark. 60
sjrusso8/spark-connect-rs An experimental client for interacting with Apache Spark clusters from Rust. 90
ngmarmaduke/cisco_spark-ruby Provides a Ruby client for interacting with the Cisco Spark API 9
ankurchavda/sparklearning A comprehensive resource for learning Apache Spark, covering its core concepts, components, and advanced topics. 649
apache/spark An analytics engine designed to handle large-scale data processing and analysis 40,002
mrpowers-io/spark-fast-tests A testing helper library for Apache Spark applications. 436
dotnet/spark Provides high-performance APIs for using Apache Spark with .NET 2,026
harryprince/geospark Enables spatial analysis in Apache Spark using SF functions 57
dmmiller612/sparktorch A PyTorch implementation on Apache Spark for distributed deep learning model training and inference. 339
gorillalabs/sparkling A Clojure API for interacting with Apache Spark 448
lensacom/sparkit-learn A Python library that integrates PySpark and scikit-learn for distributed machine learning 1,155
kcrandall/emr_spark_automation Automates deployment of an AWS EMR cluster and execution of Spark jobs 8