druid
Data processing engine
A high-performance real-time analytics database for fast queries and ingest
Apache Druid: a high performance real-time analytics database.
14k stars
585 watching
4k forks
Language: Java
last commit: about 14 hours ago
Linked from 6 awesome lists
druid
Related projects:
Repository | Description | Stars |
---|---|---|
apache/pig | Enables data processing and transformation in large files using a high-level language with compile-time optimizations for efficient execution on distributed computing frameworks. | 681 |
apache/spark | An analytics engine designed to handle large-scale data processing and analysis | 40,002 |
apache/tez | A system that enables flexible data processing pipelines using a low-level engine for higher-level frameworks | 480 |
apache/datafusion-ballista | Distributed query engine for Apache DataFusion applications | 1,551 |
apache/systemds | An end-to-end data science platform that integrates data integration, machine learning model training, and deployment | 1,036 |
asavinov/bistro | A general-purpose data analysis engine that processes batch and stream data in a column-oriented manner | 7 |
apache/impala | A high-performance query engine designed to handle large-scale data processing and analytics | 1,152 |
ivelum/djangoql | An advanced search language for Django that supports logical operators, table joins, and auto-completion. | 1,011 |
allegro/turnilo | A web application providing a user-friendly interface to explore and visualize data in Apache Druid | 730 |
apache/sedona | An engine for processing spatial data at any scale within cluster computing systems | 1,960 |
apache/samza | A distributed stream processing framework for handling high-volume data streams with fault tolerance and durability guarantees | 819 |
h2oai/h2o-2 | An analytics engine that provides fast and scalable predictive modeling capabilities for big data | 2,224 |
webdb-app/app | A comprehensive database IDE with features like versioning and data inference, designed to simplify database development and management. | 181 |
cloudera/impala | A distributed SQL query engine for analyzing large datasets in Hadoop clusters | 33 |
dalmatinerdb/dqe | A distributed, in-memory query engine built on top of Erlang, designed to handle high-performance data processing and analytics tasks | 10 |