druid

Data processing engine

A high-performance real-time analytics database for fast queries and ingest

Apache Druid: a high performance real-time analytics database.

GitHub

14k stars

584 watching

4k forks

Language: Java

last commit: over 1 year ago

Linked from 6 awesome lists

druid

druid.apache.org/

Backlinks from these awesome lists:

Related projects:

Repository	Description	Stars
apache/pig	Enables data processing and transformation in large files using a high-level language with compile-time optimizations for efficient execution on distributed computing frameworks.	682
apache/spark	An analytics engine designed to handle large-scale data processing and analysis	40,170
apache/tez	A system that enables flexible data processing pipelines using a low-level engine for higher-level frameworks	482
apache/datafusion-ballista	Distributed query engine for Apache DataFusion applications	1,580
apache/systemds	An end-to-end data science platform that integrates data integration, machine learning model training, and deployment	1,038
asavinov/bistro	A general-purpose data analysis engine that processes batch and stream data in a column-oriented manner	7
apache/impala	A high-performance query engine designed to handle large-scale data processing and analytics	1,164
ivelum/djangoql	An advanced search library for Django models with auto-completion and support for logical operators and table joins.	1,025
allegro/turnilo	A web application providing a user-friendly interface to explore and visualize data in Apache Druid	733
apache/sedona	A software framework that enables developers to process spatial data at any scale within modern cluster computing systems.	1,974
apache/samza	A distributed stream processing framework for handling high-volume data streams with fault tolerance and durability guarantees	817
h2oai/h2o-2	An analytics engine that provides fast and scalable predictive modeling capabilities for big data	2,224
webdb-app/app	A comprehensive database IDE with features like versioning and data inference, designed to simplify database development and management.	192
cloudera/impala	A distributed SQL query engine for analyzing large datasets in Hadoop clusters	34
dalmatinerdb/dqe	A distributed, in-memory query engine built on top of Erlang, designed to handle high-performance data processing and analytics tasks	10