druid
Data processing engine
A high-performance real-time analytics database for fast queries and ingest
Apache Druid: a high performance real-time analytics database.
14k stars
584 watching
4k forks
Language: Java
last commit: 4 days ago
Linked from 6 awesome lists
druid
Related projects:
Repository | Description | Stars |
---|---|---|
apache/pig | Enables data processing and transformation in large files using a high-level language with compile-time optimizations for efficient execution on distributed computing frameworks. | 682 |
apache/spark | An analytics engine designed to handle large-scale data processing and analysis | 40,170 |
apache/tez | A system that enables flexible data processing pipelines using a low-level engine for higher-level frameworks | 482 |
apache/datafusion-ballista | Distributed query engine for Apache DataFusion applications | 1,580 |
apache/systemds | An end-to-end data science platform that integrates data integration, machine learning model training, and deployment | 1,038 |
asavinov/bistro | A general-purpose data analysis engine that processes batch and stream data in a column-oriented manner | 7 |
apache/impala | A high-performance query engine designed to handle large-scale data processing and analytics | 1,164 |
ivelum/djangoql | An advanced search library for Django models with auto-completion and support for logical operators and table joins. | 1,025 |
allegro/turnilo | A web application providing a user-friendly interface to explore and visualize data in Apache Druid | 733 |
apache/sedona | A software framework that enables developers to process spatial data at any scale within modern cluster computing systems. | 1,974 |
apache/samza | A distributed stream processing framework for handling high-volume data streams with fault tolerance and durability guarantees | 817 |
h2oai/h2o-2 | An analytics engine that provides fast and scalable predictive modeling capabilities for big data | 2,224 |
webdb-app/app | A comprehensive database IDE with features like versioning and data inference, designed to simplify database development and management. | 192 |
cloudera/impala | A distributed SQL query engine for analyzing large datasets in Hadoop clusters | 34 |
dalmatinerdb/dqe | A distributed, in-memory query engine built on top of Erlang, designed to handle high-performance data processing and analytics tasks | 10 |