sparkling-water

ML integration

Integrates H2O's machine learning capabilities with Apache Spark for big data processing and analytics

Sparkling Water provides H2O functionality inside Spark cluster

GitHub

968 stars
180 watching
359 forks
Language: Scala
last commit: about 2 months ago
Linked from 2 awesome lists

big-datah2ointegrationmachine-learningpysparkpysparklingrsparklingscalaspark

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
h2oai/h2o-3 An in-memory machine learning platform that supports various algorithms and provides tools for building, deploying, and scaling machine learning models 6,950
h2oai/h2o-tutorials Provides tutorials and training materials for machine learning with H2O, a platform for building predictive models. 1,484
h2oai/h2o-2 An analytics engine that provides fast and scalable predictive modeling capabilities for big data 2,224
h2oai/h2o-flow An interactive computing environment for machine learning and data analysis 134
internetarchive/sparkling A data processing library built on top of Apache Spark to handle temporal web data 11
hydrospheredata/mist A platform for deploying and managing Spark applications in a serverless environment 326
apache/spark An analytics engine designed to handle large-scale data processing and analysis 40,170
hypjudy/sparkles Develops multimodal instruction-following models for open-ended dialogues across multiple images 43
hydrospheredata/hydro-serving A MLOps platform for deploying and versioning machine learning models in production. 271
lensacom/sparkit-learn A Python library that integrates PySpark and scikit-learn for distributed machine learning 1,154
spiritlab/spark A research-focused implementation of Apache Spark with homomorphic encryption support 3
sparklyr/sparklyr An R interface to Apache Spark for distributed data analysis and machine learning 955
h2oai/mli-resources Provides tools and techniques for interpreting machine learning models 483
kotlin/kotlin-spark-api Provides compatibility and extensions between Kotlin and Apache Spark for big data processing 463
gorillalabs/sparkling A Clojure API for interacting with Apache Spark 448