spark-clustering
Clustering algorithms library
Implementations of clustering algorithms using Spark in Scala
Some Spark implementations of clustering algorithms.
18 stars
6 watching
8 forks
Language: Scala
last commit: almost 7 years ago Related projects:
| Repository | Description | Stars |
|---|---|---|
| | An implementation of data stream clustering algorithms using Spark Streaming. | 3 |
| | An implementation of the DBSCAN clustering algorithm on top of Apache Spark | 184 |
| | A library that parses and queries XML data in Apache Spark | 504 |
| | Enables manipulation of Apache Spark DataFrames using TensorFlow programs | 749 |
| | Provides compatibility and extensions between Kotlin and Apache Spark for big data processing | 463 |
| | Implementation of k-modes and k-prototypes clustering algorithms in Go. | 43 |
| | A Spark plugin that registers metrics from operational system resources | 0 |
| | Enables parallelization of machine learning tasks on a distributed Spark cluster using the joblib library. | 243 |
| | A collection of clustering algorithms implemented in JavaScript | 30 |
| | An effort to integrate Apache Spark with the Frege programming language | 5 |
| | This repository provides implementations of various clustering and subspace clustering algorithms in MATLAB, including K-means, ISODATA, Mean Shift, DBSCAN, Gaussian Mixture Model, LVQ, Subspace Clustering Algorithms like Subspace K-means and Entropy-Weighting Subspace K-means. | 227 |
| | An implementation of a clustering algorithm using Particle Swarm Optimization (PSO), specifically designed to group similar data points together. | 68 |
| | Provides implementations of various multi-view spectral clustering algorithms for data analysis and visualization. | 87 |
| | Wraps Stanford CoreNLP annotators as Spark DataFrame functions for natural language processing tasks | 422 |
| | A highly optimized time series library for Apache Spark | 1,006 |