sparkit-learn
Machine Learning Library
A Python library that integrates PySpark and scikit-learn for distributed machine learning
PySpark + Scikit-learn = Sparkit-learn
1k stars
89 watching
255 forks
Language: Python
last commit: almost 4 years ago
Linked from 2 awesome lists
apache-sparkdistributed-computingmachine-learningpythonscikit-learn
Related projects:
Repository | Description | Stars |
---|---|---|
dmmiller612/sparktorch | A PyTorch implementation on Apache Spark for distributed deep learning model training and inference. | 339 |
scikit-learn/scikit-learn | A comprehensive Python module for machine learning built on top of SciPy | 60,136 |
amueller/scipy_2015_sklearn_tutorial | Tutorials and materials for learning machine learning with Python using popular libraries like scikit-learn. | 576 |
gmonce/scikit-learn-book | Source code and data for a machine learning book with Python tutorials | 393 |
sparklyr/sparklyr | An R interface to Apache Spark for distributed data analysis and machine learning | 957 |
gaelvaroquaux/scikit-learn-tutorial | A tutorial on applying machine learning to practical situations using the scikit-learn library | 129 |
flint-bot/sparky | Provides a NodeJS API to interact with the Cisco Spark platform | 16 |
lightning-universe/lightning-bolts | Provides a toolbox of components to extend PyTorch Lightning for deep learning research and production | 1,693 |
josephreisinger/vowpal_porpoise | Lightweight machine learning library with interface to scikit-learn and vowpal_wabbit | 166 |
kaiyangzhou/dassl.pytorch | A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. | 1,217 |
lehy/ocaml-sklearn | Enables machine learning with scikit-learn in OCaml | 34 |
visenger/handson-ml | Teaches Machine Learning fundamentals in Python using Scikit-Learn and TensorFlow | 6 |
scikit-learn-contrib/metric-learn | A Python library providing efficient implementations of various supervised and weakly-supervised metric learning algorithms. | 1,399 |
cstjean/scikitlearn.jl | A Julia implementation of popular machine learning algorithms and interfaces. | 544 |
tubular/sparkly | A set of Python libraries and tools to simplify interactions with various data sources using Apache Spark. | 60 |