dbscan-on-spark

Clustering algorithm

An implementation of the DBSCAN clustering algorithm on top of Apache Spark

An implementation of DBSCAN runing on top of Apache Spark

GitHub

184 stars
19 watching
58 forks
Language: Scala
last commit: almost 7 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
tugdualsarazin/spark-clustering Implementations of clustering algorithms using Spark in Scala 18
spark-clustering-notebook/g-stream An implementation of data stream clustering algorithms using Spark Streaming. 3
apache/spark An analytics engine designed to handle large-scale data processing and analysis 40,002
dotnet/spark Provides high-performance APIs for using Apache Spark with .NET 2,026
yaooqinn/itachi A library that brings useful functions from various modern database management systems to Apache Spark 56
iralabdisco/pso-clustering An algorithm for unsupervised machine learning tasks involving grouping similar data points into clusters. 68
databricks/spark-csv A library for parsing and querying CSV data with Apache Spark 1,053
khadidjam/dc-dpm A Distributed Clustering algorithm based on Dirichlet Process Mixture Model using Apache Spark 4
instaclustr/sample-kafkasparkcassandra An introductory Scala app using Apache Spark Streaming to process data from Kafka and write summaries to Cassandra. 23
spiritlab/spark A research-focused implementation of Apache Spark with homomorphic encryption support 3
nchammas/flintrock A command-line tool for launching and managing Apache Spark clusters on AWS 638
datastax/spark-cassandra-connector A library that enables integration between Apache Spark and Apache Cassandra for fast data processing and analysis. 1,942
hortonworks-spark/shc A Spark connector for accessing HBase as an external data source or sink with optimized support for DataFrame and DataSet operations 552
shahsohil/dcc An implementation of Deep Continuous Clustering algorithms using PyTorch. 209
kotlin/kotlin-spark-api Provides compatibility and extensions between Kotlin and Apache Spark for big data processing 463