sparkit-learn

Machine Learning Library

A Python library that integrates PySpark and scikit-learn for distributed machine learning

PySpark + Scikit-learn = Sparkit-learn

GitHub

1k stars
89 watching
255 forks
Language: Python
last commit: almost 4 years ago
Linked from 2 awesome lists

apache-sparkdistributed-computingmachine-learningpythonscikit-learn

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
dmmiller612/sparktorch A PyTorch implementation on Apache Spark for distributed deep learning model training and inference. 339
scikit-learn/scikit-learn A comprehensive Python module for machine learning built on top of SciPy 60,136
amueller/scipy_2015_sklearn_tutorial Tutorials and materials for learning machine learning with Python using popular libraries like scikit-learn. 576
gmonce/scikit-learn-book Source code and data for a machine learning book with Python tutorials 393
sparklyr/sparklyr An R interface to Apache Spark for distributed data analysis and machine learning 957
gaelvaroquaux/scikit-learn-tutorial A tutorial on applying machine learning to practical situations using the scikit-learn library 129
flint-bot/sparky Provides a NodeJS API to interact with the Cisco Spark platform 16
lightning-universe/lightning-bolts Provides a toolbox of components to extend PyTorch Lightning for deep learning research and production 1,693
josephreisinger/vowpal_porpoise Lightweight machine learning library with interface to scikit-learn and vowpal_wabbit 166
kaiyangzhou/dassl.pytorch A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. 1,217
lehy/ocaml-sklearn Enables machine learning with scikit-learn in OCaml 34
visenger/handson-ml Teaches Machine Learning fundamentals in Python using Scikit-Learn and TensorFlow 6
scikit-learn-contrib/metric-learn A Python library providing efficient implementations of various supervised and weakly-supervised metric learning algorithms. 1,399
cstjean/scikitlearn.jl A Julia implementation of popular machine learning algorithms and interfaces. 544
tubular/sparkly A set of Python libraries and tools to simplify interactions with various data sources using Apache Spark. 60