datafusion-ballista

Query engine

Distributed query engine for Apache DataFusion applications

Apache DataFusion Ballista Distributed Query Engine

GitHub

2k stars
50 watching
198 forks
Language: Rust
last commit: about 1 month ago
Linked from 1 awesome list

arrowbig-datadataframedistributedolappythonquery-enginerustsql

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
apache/datafusion-ballista-python Bindings for using Apache Arrow's query engine in Python to analyze and manipulate large datasets 34
fedomn/sqlrs An embedded in-process SQL query engine designed to handle OLAP workloads using Rust and Apache Arrow. 109
apache/datafusion-python A Python library that provides a data processing and querying framework using the Apache Arrow in-memory query engine. 385
apache/druid A high-performance real-time analytics database for fast queries and ingest 13,548
cloudera/impala A distributed SQL query engine for analyzing large datasets in Hadoop clusters 34
paradedb/pg_analytics Enables direct querying of data lakes from Postgres without moving data to a cloud data warehouse 407
swirrl/matcha An in-memory graph query engine with a SPARQL-like DSL for querying Linked Data Models 22
apache/drill A distributed query layer for Hadoop and NoSQL data storage systems, supporting various query languages. 1,949
apache/spark An analytics engine designed to handle large-scale data processing and analysis 40,170
dalmatinerdb/dqe A distributed, in-memory query engine built on top of Erlang, designed to handle high-performance data processing and analytics tasks 10
apache/tez A system that enables flexible data processing pipelines using a low-level engine for higher-level frameworks 482
protegeproject/sparql-dl-api A query engine for a specific query language used in semantic web applications 12
callidon/sparql-engine A framework for building query engines on top of various data storage systems using the SPARQL query language 101
apache/impala A high-performance query engine designed to handle large-scale data processing and analytics 1,164
npgall/cqengine A high-performance Java collection that enables fast and efficient querying of data using SQL-like syntax 1,728