DC-DPM

Clustering algorithm

A Distributed Clustering algorithm based on Dirichlet Process Mixture Model using Apache Spark

GitHub

4 stars
1 watching
0 forks
Language: Scala
last commit: over 5 years ago

Related projects:

Repository Description Stars
navdeep-g/h2o3-pam An implementation of the PAM algorithm in H2O-3 for clustering data. 1
shahsohil/dcc An implementation of Deep Continuous Clustering algorithms using PyTorch. 209
irvingc/dbscan-on-spark An implementation of the DBSCAN clustering algorithm on top of Apache Spark 184
iralabdisco/pso-clustering An implementation of a clustering algorithm using Particle Swarm Optimization (PSO), specifically designed to group similar data points together. 68
thedatumorg/kshape-matlab A Matlab implementation of an efficient and accurate unsupervised method for clustering time-series data 53
kalvar/ios-krkmeans-algorithm K-Means clustering algorithm implementation with multi-dimensional support and customizable features 23
xuyxu/clustering This repository provides implementations of various clustering and subspace clustering algorithms in MATLAB, including K-means, ISODATA, Mean Shift, DBSCAN, Gaussian Mixture Model, LVQ, Subspace Clustering Algorithms like Subspace K-means and Entropy-Weighting Subspace K-means. 227
ymjs-irfan/dp-fedsam This repository provides an implementation of a differentially private federated learning algorithm designed to improve the robustness and performance of federated machine learning systems. 42
thupchnsky/mufc An implementation of a federated clustering approach with an unlearning mechanism to improve data privacy and security in distributed machine learning 19
alexgkendall/optics_clustering A MATLAB implementation of an unsupervised clustering algorithm that groups data points based on their density and reachability distances 58
lmcinnes/hdbscan An algorithm for clustering data based on density and distance relationships 94
tugdualsarazin/spark-clustering Implementations of clustering algorithms using Spark in Scala 18
navdeep-g/h2o3-gapstat An implementation of the Gap Statistic method to determine optimal cluster counts in datasets 1
spark-clustering-notebook/g-stream An implementation of data stream clustering algorithms using Spark Streaming. 3
kalvar/ios-krfuzzycmeans-algorithm An implementation of Fuzzy C-Means clustering algorithm with fuzzy theory on Machine Learning for data mining and image compression. 12