docker-spark

Spark container

A Docker image with Apache Spark pre-installed and configured for easy deployment on YARN clusters.

GitHub

765 stars
65 watching
282 forks
Language: Shell
last commit: over 3 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
microsoft/mobius Provides a C# API for interacting with Apache Spark 942
janeliascicomp/nextflow-spark Provides a reusable set of Nextflow subworkflows and processes for creating transient Apache Spark clusters on any infrastructure. 14
dotnet/spark Provides high-performance APIs for using Apache Spark with .NET 2,023
databricks/docker-spark-iceberg A Docker-based environment for running Spark and Iceberg in a quick start scenario. 256
jorgebucaran/spark.fish A Fish script that displays sparklines in the terminal to visualize data ranges or sequences. 346
ciscove/sparkbundle Provides an API interface for integrating Cisco Spark services into Symfony2 applications. 1
afeiship/docker-sequenceserver A Docker container that provides a pre-configured environment for a sequenceserver application. 2
apple/batch-processing-gateway A tool to simplify running Spark on Kubernetes 181
tubular/sparkly A set of Python libraries and tools to simplify interactions with various data sources using Apache Spark. 60
joblib/joblib-spark Enables parallelization of machine learning tasks on a distributed Spark cluster using the joblib library. 242
sw1sh/frege-spark An effort to integrate Apache Spark with the Frege programming language 5
flint-bot/sparky Provides a NodeJS API to interact with the Cisco Spark platform 16
jupyter-incubator/sparkmagic An open source library that enables interactive development of applications using remote Spark clusters 1,328
kotlin/kotlin-spark-api Provides compatibility and extensions between Kotlin and Apache Spark for big data processing 461
webex/spark-ios-sdk-example-buddies A sample implementation of a real-time communication application 3