datafusion-ballista

Query engine

Distributed query engine for Apache DataFusion applications

Apache DataFusion Ballista Distributed Query Engine

GitHub

2k stars
51 watching
197 forks
Language: Rust
last commit: 1 day ago
Linked from 1 awesome list

arrowbig-datadataframedistributedolappythonquery-enginerustsql

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
apache/datafusion-ballista-python Bindings for using Apache Arrow's query engine in Python to analyze and manipulate large datasets 33
fedomn/sqlrs An embedded in-process SQL query engine designed to handle OLAP workloads using Rust and Apache Arrow. 109
apache/datafusion-python A Python library that provides a data processing and querying framework using the Apache Arrow in-memory query engine. 377
apache/druid A high-performance real-time analytics database for fast queries and ingest 13,523
cloudera/impala A distributed SQL query engine for analyzing large datasets in Hadoop clusters 33
paradedb/pg_analytics Enables direct querying of large data volumes from Postgres using a high-performance analytical query engine 380
swirrl/matcha An in-memory graph query engine with a SPARQL-like DSL for querying Linked Data Models 21
apache/drill A distributed query layer for Hadoop and NoSQL data storage systems, supporting various query languages. 1,948
apache/spark An analytics engine designed to handle large-scale data processing and analysis 40,002
dalmatinerdb/dqe A distributed, in-memory query engine built on top of Erlang, designed to handle high-performance data processing and analytics tasks 10
apache/tez A system that enables flexible data processing pipelines using a low-level engine for higher-level frameworks 480
protegeproject/sparql-dl-api A query engine for a specific query language used in semantic web applications 12
callidon/sparql-engine A framework for building query engines on top of various data storage systems using the SPARQL query language 100
apache/impala A high-performance query engine designed to handle large-scale data processing and analytics 1,152
npgall/cqengine A high-performance Java collection that enables fast and efficient querying of data using SQL-like syntax 1,724