dbscan-on-spark
Clustering algorithm
An implementation of the DBSCAN clustering algorithm on top of Apache Spark
An implementation of DBSCAN runing on top of Apache Spark
184 stars
19 watching
58 forks
Language: Scala
last commit: almost 7 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
tugdualsarazin/spark-clustering | Implementations of clustering algorithms using Spark in Scala | 18 |
spark-clustering-notebook/g-stream | An implementation of data stream clustering algorithms using Spark Streaming. | 3 |
apache/spark | An analytics engine designed to handle large-scale data processing and analysis | 40,002 |
dotnet/spark | Provides high-performance APIs for using Apache Spark with .NET | 2,026 |
yaooqinn/itachi | A library that brings useful functions from various modern database management systems to Apache Spark | 56 |
iralabdisco/pso-clustering | An algorithm for unsupervised machine learning tasks involving grouping similar data points into clusters. | 68 |
databricks/spark-csv | A library for parsing and querying CSV data with Apache Spark | 1,053 |
khadidjam/dc-dpm | A Distributed Clustering algorithm based on Dirichlet Process Mixture Model using Apache Spark | 4 |
instaclustr/sample-kafkasparkcassandra | An introductory Scala app using Apache Spark Streaming to process data from Kafka and write summaries to Cassandra. | 23 |
spiritlab/spark | A research-focused implementation of Apache Spark with homomorphic encryption support | 3 |
nchammas/flintrock | A command-line tool for launching and managing Apache Spark clusters on AWS | 638 |
datastax/spark-cassandra-connector | A library that enables integration between Apache Spark and Apache Cassandra for fast data processing and analysis. | 1,942 |
hortonworks-spark/shc | A Spark connector for accessing HBase as an external data source or sink with optimized support for DataFrame and DataSet operations | 552 |
shahsohil/dcc | An implementation of Deep Continuous Clustering algorithms using PyTorch. | 209 |
kotlin/kotlin-spark-api | Provides compatibility and extensions between Kotlin and Apache Spark for big data processing | 463 |