datafusion-ballista

Query engine

Distributed query engine for Apache DataFusion applications

Apache DataFusion Ballista Distributed Query Engine

GitHub

2k stars

50 watching

198 forks

Language: Rust

last commit: over 1 year ago

Linked from 1 awesome list

arrowbig-datadataframedistributedolappythonquery-enginerustsql

Screenshot of apache/datafusion-ballista website

datafusion.apache.org/ballista

Backlinks from these awesome lists:

manuzhang/awesome-streaming

Related projects:

Repository	Description	Stars
apache/datafusion-ballista-python	Bindings for using Apache Arrow's query engine in Python to analyze and manipulate large datasets	34
fedomn/sqlrs	An embedded in-process SQL query engine designed to handle OLAP workloads using Rust and Apache Arrow.	109
apache/datafusion-python	A Python library that provides a data processing and querying framework using the Apache Arrow in-memory query engine.	385
apache/druid	A high-performance real-time analytics database for fast queries and ingest	13,548
cloudera/impala	A distributed SQL query engine for analyzing large datasets in Hadoop clusters	34
paradedb/pg_analytics	Enables direct querying of data lakes from Postgres without moving data to a cloud data warehouse	407
swirrl/matcha	An in-memory graph query engine with a SPARQL-like DSL for querying Linked Data Models	22
apache/drill	A distributed query layer for Hadoop and NoSQL data storage systems, supporting various query languages.	1,949
apache/spark	An analytics engine designed to handle large-scale data processing and analysis	40,170
dalmatinerdb/dqe	A distributed, in-memory query engine built on top of Erlang, designed to handle high-performance data processing and analytics tasks	10
apache/tez	A system that enables flexible data processing pipelines using a low-level engine for higher-level frameworks	482
protegeproject/sparql-dl-api	A query engine for a specific query language used in semantic web applications	12
callidon/sparql-engine	A framework for building query engines on top of various data storage systems using the SPARQL query language	101
apache/impala	A high-performance query engine designed to handle large-scale data processing and analytics	1,164
npgall/cqengine	A high-performance Java collection that enables fast and efficient querying of data using SQL-like syntax	1,728