nextflow-spark
Spark wrapper
Provides a reusable set of Nextflow subworkflows and processes for creating transient Apache Spark clusters on any infrastructure.
14 stars
4 watching
3 forks
Language: Nextflow
last commit: about 2 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
microsoft/mobius | Provides a C# API for interacting with Apache Spark | 941 |
tubular/sparkly | A set of Python libraries and tools to simplify interactions with various data sources using Apache Spark. | 60 |
gorillalabs/sparkling | A Clojure API for interacting with Apache Spark | 448 |
ondra-m/ruby-spark | A Ruby wrapper around Apache Spark's functionality for large-scale data processing | 227 |
databricks/spark-corenlp | Wraps Stanford CoreNLP annotators as Spark DataFrame functions for natural language processing tasks | 422 |
sequenceiq/docker-spark | A Docker image with Apache Spark pre-installed and configured for easy deployment on YARN clusters. | 765 |
dotnet/spark | Provides high-performance APIs for using Apache Spark with .NET | 2,026 |
jfield44/sparksdkwrapper | Convenience library to embed voice and video calling into an iOS app using the Cisco Spark SDK | 6 |
instaclustr/sample-kafkasparkcassandra | An introductory Scala app using Apache Spark Streaming to process data from Kafka and write summaries to Cassandra. | 23 |
flint-bot/sparky | Provides a NodeJS API to interact with the Cisco Spark platform | 16 |
cloudfroster/react-workflow | A comprehensive React-based SPA boilerplate with various development tools and automation scripts. | 66 |
apache/spark | An analytics engine designed to handle large-scale data processing and analysis | 40,002 |
rougin/spark-plug | A tool that simplifies testing and development with Codeigniter 3 by providing an application instance as a single variable. | 15 |
joblib/joblib-spark | Enables parallelization of machine learning tasks on a distributed Spark cluster using the joblib library. | 242 |
databricks/tensorframes | Enables manipulation of Apache Spark DataFrames using TensorFlow programs | 749 |