datafusion-ballista
Query engine
A distributed SQL query engine built on Apache Arrow and Rust, designed to provide efficient columnar processing and low memory usage.
Apache DataFusion Ballista Distributed Query Engine
2k stars
52 watching
196 forks
Language: Rust
last commit: 9 days ago
Linked from 1 awesome list
arrowbig-datadataframedistributedolappythonquery-enginerustsql
Related projects:
Repository | Description | Stars |
---|---|---|
apache/datafusion-ballista-python | Bindings for using Apache Arrow's query engine in Python to analyze and manipulate large datasets | 33 |
fedomn/sqlrs | An embedded in-process SQL query engine designed to handle OLAP workloads using Rust and Apache Arrow. | 109 |
apache/datafusion-python | A Python library that provides a data processing and querying framework using the Apache Arrow in-memory query engine. | 375 |
apache/druid | A high-performance real-time analytics database for fast queries and ingest | 13,513 |
cloudera/impala | A distributed SQL query engine for analyzing large datasets in Hadoop clusters | 33 |
paradedb/pg_analytics | Enables direct querying of large data volumes from Postgres using a high-performance analytical query engine | 380 |
swirrl/matcha | An in-memory graph query engine with a SPARQL-like DSL for querying Linked Data Models | 21 |
apache/drill | A distributed query layer for Hadoop and NoSQL data storage systems, supporting various query languages. | 1,945 |
apache/spark | An analytics engine designed to handle large-scale data processing and analysis | 39,916 |
dalmatinerdb/dqe | A distributed, in-memory query engine built on top of Erlang, designed to handle high-performance data processing and analytics tasks | 10 |
apache/tez | A system that enables flexible data processing pipelines using a low-level engine for higher-level frameworks | 479 |
protegeproject/sparql-dl-api | A query engine for a specific query language used in semantic web applications | 12 |
callidon/sparql-engine | A framework for building query engines on top of various data storage systems using the SPARQL query language | 100 |
apache/impala | A high-performance query engine designed to handle large-scale data processing and analytics | 1,151 |
npgall/cqengine | A high-performance Java collection that enables fast and efficient querying of data using SQL-like syntax | 1,722 |