druid

Data processing engine

A high-performance real-time analytics database for fast queries and ingest

Apache Druid: a high performance real-time analytics database.

GitHub

14k stars
585 watching
4k forks
Language: Java
last commit: about 14 hours ago
Linked from 6 awesome lists

druid

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
apache/pig Enables data processing and transformation in large files using a high-level language with compile-time optimizations for efficient execution on distributed computing frameworks. 681
apache/spark An analytics engine designed to handle large-scale data processing and analysis 40,002
apache/tez A system that enables flexible data processing pipelines using a low-level engine for higher-level frameworks 480
apache/datafusion-ballista Distributed query engine for Apache DataFusion applications 1,551
apache/systemds An end-to-end data science platform that integrates data integration, machine learning model training, and deployment 1,036
asavinov/bistro A general-purpose data analysis engine that processes batch and stream data in a column-oriented manner 7
apache/impala A high-performance query engine designed to handle large-scale data processing and analytics 1,152
ivelum/djangoql An advanced search language for Django that supports logical operators, table joins, and auto-completion. 1,011
allegro/turnilo A web application providing a user-friendly interface to explore and visualize data in Apache Druid 730
apache/sedona An engine for processing spatial data at any scale within cluster computing systems 1,960
apache/samza A distributed stream processing framework for handling high-volume data streams with fault tolerance and durability guarantees 819
h2oai/h2o-2 An analytics engine that provides fast and scalable predictive modeling capabilities for big data 2,224
webdb-app/app A comprehensive database IDE with features like versioning and data inference, designed to simplify database development and management. 181
cloudera/impala A distributed SQL query engine for analyzing large datasets in Hadoop clusters 33
dalmatinerdb/dqe A distributed, in-memory query engine built on top of Erlang, designed to handle high-performance data processing and analytics tasks 10