druid

Data processing engine

A high-performance real-time analytics database for fast queries and ingest

Apache Druid: a high performance real-time analytics database.

GitHub

14k stars
585 watching
4k forks
Language: Java
last commit: 6 days ago
Linked from 6 awesome lists

druid

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
apache/pig Enables data processing and transformation in large files using a high-level language with compile-time optimizations for efficient execution on distributed computing frameworks. 681
apache/spark An analytics engine designed to handle large-scale data processing and analysis 39,916
apache/tez A system that enables flexible data processing pipelines using a low-level engine for higher-level frameworks 479
apache/datafusion-ballista A distributed SQL query engine built on Apache Arrow and Rust, designed to provide efficient columnar processing and low memory usage. 1,544
apache/systemds An end-to-end data science platform that integrates data integration, machine learning model training, and deployment 1,035
asavinov/bistro A general-purpose data analysis engine that processes batch and stream data in a column-oriented manner 7
apache/impala A high-performance query engine designed to handle large-scale data processing and analytics 1,151
ivelum/djangoql An advanced search language for Django that supports logical operators, table joins, and auto-completion. 1,011
allegro/turnilo A web application providing a user-friendly interface to explore and visualize data in Apache Druid 729
apache/sedona An open-source spatial computing engine for processing large-scale geospatial data in cluster environments 1,955
apache/samza A distributed stream processing framework for handling high-volume data streams with fault tolerance and durability guarantees 820
h2oai/h2o-2 An analytics engine that provides fast and scalable predictive modeling capabilities for big data 2,224
webdb-app/app A comprehensive database IDE with features like versioning and data inference, designed to simplify database development and management. 181
cloudera/impala A distributed SQL query engine for analyzing large datasets in Hadoop clusters 33
dalmatinerdb/dqe A distributed, in-memory query engine built on top of Erlang, designed to handle high-performance data processing and analytics tasks 10