datafusion-ballista

Query engine

A distributed SQL query engine built on Apache Arrow and Rust, designed to provide efficient columnar processing and low memory usage.

Apache DataFusion Ballista Distributed Query Engine

GitHub

2k stars
52 watching
196 forks
Language: Rust
last commit: 9 days ago
Linked from 1 awesome list

arrowbig-datadataframedistributedolappythonquery-enginerustsql

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
apache/datafusion-ballista-python Bindings for using Apache Arrow's query engine in Python to analyze and manipulate large datasets 33
fedomn/sqlrs An embedded in-process SQL query engine designed to handle OLAP workloads using Rust and Apache Arrow. 109
apache/datafusion-python A Python library that provides a data processing and querying framework using the Apache Arrow in-memory query engine. 375
apache/druid A high-performance real-time analytics database for fast queries and ingest 13,513
cloudera/impala A distributed SQL query engine for analyzing large datasets in Hadoop clusters 33
paradedb/pg_analytics Enables direct querying of large data volumes from Postgres using a high-performance analytical query engine 380
swirrl/matcha An in-memory graph query engine with a SPARQL-like DSL for querying Linked Data Models 21
apache/drill A distributed query layer for Hadoop and NoSQL data storage systems, supporting various query languages. 1,945
apache/spark An analytics engine designed to handle large-scale data processing and analysis 39,916
dalmatinerdb/dqe A distributed, in-memory query engine built on top of Erlang, designed to handle high-performance data processing and analytics tasks 10
apache/tez A system that enables flexible data processing pipelines using a low-level engine for higher-level frameworks 479
protegeproject/sparql-dl-api A query engine for a specific query language used in semantic web applications 12
callidon/sparql-engine A framework for building query engines on top of various data storage systems using the SPARQL query language 100
apache/impala A high-performance query engine designed to handle large-scale data processing and analytics 1,151
npgall/cqengine A high-performance Java collection that enables fast and efficient querying of data using SQL-like syntax 1,722