sparta

Real-time analytics platform

A real-time analytics platform built on Apache Spark and Kafka, allowing users to process large datasets in near-real time using declarative workflows.

Real Time Analytics and Data Pipelines based on Spark Streaming

GitHub

525 stars
137 watching
196 forks
Language: Scala
last commit: about 5 years ago
analyticshdfskafkalambdaolapreal-timescalasparkspark-streamingsparksqlspartastratiostratio-spartastreamingstreaming-datatriggersworkflow

Related projects:

Repository Description Stars
strat0sphere/spark-euca Provides scripts to deploy multiple big data tools in a managed environment using Eucalyptus and Amazon AWS 1
apache/spark An analytics engine designed to handle large-scale data processing and analysis 39,916
opensoc/opensoc A centralized platform for security monitoring and analysis utilizing open-source big data technologies to integrate log aggregation, packet capture indexing, advanced analytics, and threat intelligence. 572
dotnet/spark Provides high-performance APIs for using Apache Spark with .NET 2,023
gazette/core Enables teams to build platforms mixing SQL, batch, and real-time streaming processing paradigms 718
samsara/samsara A real-time analytics platform built on Clojure that processes IoT data streams and generates actionable insights. 147
stamusnetworks/suricata-analytics Provides resources and tools for analyzing Suricata data 27
instaclustr/sample-kafkasparkcassandra An introductory Scala app using Apache Spark Streaming to process data from Kafka and write summaries to Cassandra. 23
rwalk/straw A platform for real-time streaming search that supports scalable Lucene query capabilities and configuration management 103
sryza/spark-timeseries A comprehensive library for time series analysis on Apache Spark using Scala and other libraries. 1,194
stamusnetworks/kts Customizable dashboards and visualizations for security monitoring and analysis using Suricata IDPS and the ELK stack. 33
sirthias/swave A toolkit for building high-performance streaming applications in Scala 171
glacials/splits-io A speedrunning data store and analysis engine that enables runners to improve through data analysis. 133
hydrospheredata/mist A platform for deploying and managing Spark applications in a serverless environment 326
yahoo/panoptes-stream A distributed streaming network telemetry system 40