bistro

Data analytics engine

A general-purpose data analysis engine that processes batch and stream data in a column-oriented manner

A general-purpose data analysis engine radically changing the way batch and stream data is processed

GitHub

7 stars
2 watching
0 forks
Language: Java
last commit: about 6 years ago
Linked from 3 awesome lists

analyticsbig-data-analyticsedge-analyticsiotstream-analyticsstream-processing

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
h2oai/h2o-2 An analytics engine that provides fast and scalable predictive modeling capabilities for big data 2,224
asavinov/lambdo A workflow engine that unifies feature engineering and machine learning operations for data analysis. 23
apache/spark An analytics engine designed to handle large-scale data processing and analysis 39,916
jsoftware/jsource Provides a high-level programming language and runtime environment for statistical and logical analysis of data 662
apache/druid A high-performance real-time analytics database for fast queries and ingest 13,513
abistarun/resseract-lite An application tool for visualizing and analyzing data 4
johnsonc/lambdo A workflow engine for unifying feature engineering and machine learning operations in data analysis pipelines 1
samsara/samsara A real-time analytics platform built on Clojure that processes IoT data streams and generates actionable insights. 147
apache/tez A system that enables flexible data processing pipelines using a low-level engine for higher-level frameworks 479
analysiscenter/batchflow A framework for defining and executing data processing and machine learning workflows with support for batch processing, lazy execution, and model training. 201
apache/samza A distributed stream processing framework for handling high-volume data streams with fault tolerance and durability guarantees 820
stanfordhci/datavore A small JavaScript database engine designed to support fast aggregation queries in web-based analytics and visualization applications. 248
apache/datafusion-ballista A distributed SQL query engine built on Apache Arrow and Rust, designed to provide efficient columnar processing and low memory usage. 1,544
bcgov/fasstr An R package to analyze and visualize streamflow data. 55
zavtech/morpheus-core A high-performance data analysis library for large-scale JVM applications with support for parallel processing and scalable data structures. 238