sparta
Real-time analytics platform
A real-time analytics platform built on Apache Spark and Kafka, allowing users to process large datasets in near-real time using declarative workflows.
Real Time Analytics and Data Pipelines based on Spark Streaming
525 stars
137 watching
196 forks
Language: Scala
last commit: about 5 years ago analyticshdfskafkalambdaolapreal-timescalasparkspark-streamingsparksqlspartastratiostratio-spartastreamingstreaming-datatriggersworkflow
Related projects:
Repository | Description | Stars |
---|---|---|
strat0sphere/spark-euca | Provides scripts to deploy multiple big data tools in a managed environment using Eucalyptus and Amazon AWS | 1 |
apache/spark | An analytics engine designed to handle large-scale data processing and analysis | 39,916 |
opensoc/opensoc | A centralized platform for security monitoring and analysis utilizing open-source big data technologies to integrate log aggregation, packet capture indexing, advanced analytics, and threat intelligence. | 572 |
dotnet/spark | Provides high-performance APIs for using Apache Spark with .NET | 2,023 |
gazette/core | Enables teams to build platforms mixing SQL, batch, and real-time streaming processing paradigms | 718 |
samsara/samsara | A real-time analytics platform built on Clojure that processes IoT data streams and generates actionable insights. | 147 |
stamusnetworks/suricata-analytics | Provides resources and tools for analyzing Suricata data | 27 |
instaclustr/sample-kafkasparkcassandra | An introductory Scala app using Apache Spark Streaming to process data from Kafka and write summaries to Cassandra. | 23 |
rwalk/straw | A platform for real-time streaming search that supports scalable Lucene query capabilities and configuration management | 103 |
sryza/spark-timeseries | A comprehensive library for time series analysis on Apache Spark using Scala and other libraries. | 1,194 |
stamusnetworks/kts | Customizable dashboards and visualizations for security monitoring and analysis using Suricata IDPS and the ELK stack. | 33 |
sirthias/swave | A toolkit for building high-performance streaming applications in Scala | 171 |
glacials/splits-io | A speedrunning data store and analysis engine that enables runners to improve through data analysis. | 133 |
hydrospheredata/mist | A platform for deploying and managing Spark applications in a serverless environment | 326 |
yahoo/panoptes-stream | A distributed streaming network telemetry system | 40 |