spark-notebook

Data exploration tool

An interactive web-based editor for exploring and analyzing large datasets using Scala, Apache Spark, and other data science tools

Interactive and Reactive Data Science using Scala and Spark.

GitHub

3k stars
187 watching
651 forks
Language: JavaScript
last commit: almost 3 years ago
Linked from 2 awesome lists

apache-sparkdata-sciencenotebookreactivescalaspark

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
apache/spark An analytics engine designed to handle large-scale data processing and analysis 40,170
instaclustr/sample-kafkasparkcassandra An introductory Scala app using Apache Spark Streaming to process data from Kafka and write summaries to Cassandra. 23
databricks/learning-spark Examples and tutorials for learning Spark using Java and Scala 3,892
databricks/koalas A Python package that allows users to work with pandas DataFrames on top of Apache Spark 3,343
jupyter-incubator/sparkmagic An open source library that enables interactive development of applications using remote Spark clusters 1,334
jerrylead/sparkinternals An in-depth analysis of Apache Spark's design and implementation 5,288
spiritlab/spark A research-focused implementation of Apache Spark with homomorphic encryption support 3
dotnet/spark Provides high-performance APIs for using Apache Spark with .NET 2,032
microsoft/mobius Provides a C# API for interacting with Apache Spark 941
databricks/spark-corenlp Wraps Stanford CoreNLP annotators as Spark DataFrame functions for natural language processing tasks 422
strat0sphere/spark-euca Provides scripts to deploy multiple big data tools in a managed environment using Eucalyptus and Amazon AWS 1
kotlin/kotlin-spark-api Provides compatibility and extensions between Kotlin and Apache Spark for big data processing 463
johnsnowlabs/spark-nlp Provides a set of pre-trained models and libraries for natural language processing tasks on top of Apache Spark 3,889
databricks/docker-spark-iceberg A Docker-based environment for running Spark and Iceberg in a quick start scenario. 264
datastax/spark-cassandra-connector A library that enables integration between Apache Spark and Apache Cassandra for fast data processing and analysis. 1,944