sparkling-water

ML integration

Integrates H2O's machine learning capabilities with Apache Spark for big data processing and analytics

Sparkling Water provides H2O functionality inside Spark cluster

GitHub

968 stars
180 watching
360 forks
Language: Scala
last commit: 6 days ago
Linked from 2 awesome lists

big-datah2ointegrationmachine-learningpysparkpysparklingrsparklingscalaspark

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
h2oai/h2o-3 An in-memory machine learning platform that supports various algorithms and provides tools for building, deploying, and scaling machine learning models 6,922
h2oai/h2o-tutorials Provides tutorials and training materials for machine learning with H2O, a platform for building predictive models. 1,483
h2oai/h2o-2 An analytics engine that provides fast and scalable predictive modeling capabilities for big data 2,224
h2oai/h2o-flow An interactive computing environment for machine learning and data analysis 133
internetarchive/sparkling A data processing library built on top of Apache Spark to handle temporal web data 11
hydrospheredata/mist A platform for deploying and managing Spark applications in a serverless environment 326
apache/spark An analytics engine designed to handle large-scale data processing and analysis 39,916
hypjudy/sparkles Develops multimodal instruction-following models for open-ended dialogues across multiple images 41
hydrospheredata/hydro-serving A MLOps platform for deploying and versioning machine learning models in production. 271
lensacom/sparkit-learn A Python library that integrates PySpark and scikit-learn for distributed machine learning 1,154
spiritlab/spark A research-focused implementation of Apache Spark with homomorphic encryption support 3
sparklyr/sparklyr An R interface to Apache Spark for distributed data analysis and machine learning 957
h2oai/mli-resources Provides tools and techniques for interpreting machine learning models 484
kotlin/kotlin-spark-api Provides compatibility and extensions between Kotlin and Apache Spark for big data processing 461
gorillalabs/sparkling A Clojure API for interacting with Apache Spark 448