datafusion-ballista
Query engine
Distributed query engine for Apache DataFusion applications
Apache DataFusion Ballista Distributed Query Engine
2k stars
51 watching
197 forks
Language: Rust
last commit: 1 day ago
Linked from 1 awesome list
arrowbig-datadataframedistributedolappythonquery-enginerustsql
Related projects:
Repository | Description | Stars |
---|---|---|
apache/datafusion-ballista-python | Bindings for using Apache Arrow's query engine in Python to analyze and manipulate large datasets | 33 |
fedomn/sqlrs | An embedded in-process SQL query engine designed to handle OLAP workloads using Rust and Apache Arrow. | 109 |
apache/datafusion-python | A Python library that provides a data processing and querying framework using the Apache Arrow in-memory query engine. | 377 |
apache/druid | A high-performance real-time analytics database for fast queries and ingest | 13,523 |
cloudera/impala | A distributed SQL query engine for analyzing large datasets in Hadoop clusters | 33 |
paradedb/pg_analytics | Enables direct querying of large data volumes from Postgres using a high-performance analytical query engine | 380 |
swirrl/matcha | An in-memory graph query engine with a SPARQL-like DSL for querying Linked Data Models | 21 |
apache/drill | A distributed query layer for Hadoop and NoSQL data storage systems, supporting various query languages. | 1,948 |
apache/spark | An analytics engine designed to handle large-scale data processing and analysis | 40,002 |
dalmatinerdb/dqe | A distributed, in-memory query engine built on top of Erlang, designed to handle high-performance data processing and analytics tasks | 10 |
apache/tez | A system that enables flexible data processing pipelines using a low-level engine for higher-level frameworks | 480 |
protegeproject/sparql-dl-api | A query engine for a specific query language used in semantic web applications | 12 |
callidon/sparql-engine | A framework for building query engines on top of various data storage systems using the SPARQL query language | 100 |
apache/impala | A high-performance query engine designed to handle large-scale data processing and analytics | 1,152 |
npgall/cqengine | A high-performance Java collection that enables fast and efficient querying of data using SQL-like syntax | 1,724 |