druid

Data processing engine

A high-performance real-time analytics database for fast queries and ingest

Apache Druid: a high performance real-time analytics database.

GitHub

14k stars
584 watching
4k forks
Language: Java
last commit: 4 days ago
Linked from 6 awesome lists

druid

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
apache/pig Enables data processing and transformation in large files using a high-level language with compile-time optimizations for efficient execution on distributed computing frameworks. 682
apache/spark An analytics engine designed to handle large-scale data processing and analysis 40,170
apache/tez A system that enables flexible data processing pipelines using a low-level engine for higher-level frameworks 482
apache/datafusion-ballista Distributed query engine for Apache DataFusion applications 1,580
apache/systemds An end-to-end data science platform that integrates data integration, machine learning model training, and deployment 1,038
asavinov/bistro A general-purpose data analysis engine that processes batch and stream data in a column-oriented manner 7
apache/impala A high-performance query engine designed to handle large-scale data processing and analytics 1,164
ivelum/djangoql An advanced search library for Django models with auto-completion and support for logical operators and table joins. 1,025
allegro/turnilo A web application providing a user-friendly interface to explore and visualize data in Apache Druid 733
apache/sedona A software framework that enables developers to process spatial data at any scale within modern cluster computing systems. 1,974
apache/samza A distributed stream processing framework for handling high-volume data streams with fault tolerance and durability guarantees 817
h2oai/h2o-2 An analytics engine that provides fast and scalable predictive modeling capabilities for big data 2,224
webdb-app/app A comprehensive database IDE with features like versioning and data inference, designed to simplify database development and management. 192
cloudera/impala A distributed SQL query engine for analyzing large datasets in Hadoop clusters 34
dalmatinerdb/dqe A distributed, in-memory query engine built on top of Erlang, designed to handle high-performance data processing and analytics tasks 10