sparkly

Spark wrapper

A set of Python libraries and tools to simplify interactions with various data sources using Apache Spark.

Helpers & syntactic sugar for PySpark.

GitHub

60 stars
41 watching
9 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list

pysparkpythonspark

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
brbester/pyciscospark Provides an interface to the Cisco Spark REST API 30
gorillalabs/sparkling A Clojure API for interacting with Apache Spark 448
svenkreiss/pysparkling A lightweight Python implementation of Spark's RDD and DStream interfaces for improved performance on small datasets 262
janeliascicomp/nextflow-spark Provides a reusable set of Nextflow subworkflows and processes for creating transient Apache Spark clusters on any infrastructure. 14
sparklyr/sparklyr An R interface to Apache Spark for distributed data analysis and machine learning 957
microsoft/mobius Provides a C# API for interacting with Apache Spark 941
jupyter-incubator/sparkmagic An open source library that enables interactive development of applications using remote Spark clusters 1,331
amplab-extras/sparkr-pkg Provides a lightweight R interface to Apache Spark for data processing 641
ondra-m/ruby-spark A Ruby wrapper around Apache Spark's functionality for large-scale data processing 227
sciotaio/micropython-sparkplugb An implementation of the Eclipse Sparkplug B Specification for MicroPython 10
spiraldb/ziggy-pydust A toolkit for building Python extensions in Zig. 417
flint-bot/sparky Provides a NodeJS API to interact with the Cisco Spark platform 16
ankurchavda/sparklearning A comprehensive resource for learning Apache Spark, covering its core concepts, components, and advanced topics. 649
hypjudy/sparkles Develops multimodal instruction-following models for open-ended dialogues across multiple images 41
lensacom/sparkit-learn A Python library that integrates PySpark and scikit-learn for distributed machine learning 1,155