arrow

Data interchange toolkit

A toolkit for efficient data interchange and in-memory analytics in various languages

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

GitHub

15k stars
351 watching
4k forks
Language: C++
last commit: 1 day ago
Linked from 3 awesome lists

arrow

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
duckdb/arrow An extension for DuckDB that enables support for Apache Arrow, providing functions to serialize tables and scan arrow ipc buffers. 34
wesm/feather A binary data frame storage system that enables efficient and interoperable data sharing across multiple programming languages. 2,742
apache/datafusion-ballista-python Bindings for using Apache Arrow's query engine in Python to analyze and manipulate large datasets 33
apache/orc A columnar storage format for Hadoop workloads that optimizes data access and reduces query performance requirements. 692
android10/arrow A collection of reusable utility classes and methods for Java and Android development 439
apache/datafusion-ballista Distributed query engine for Apache DataFusion applications 1,551
lancedb/lance A modern columnar data format for machine learning and large language models. 3,956
apache/fury A high-performance serialization framework that supports multiple languages and protocols, designed to improve data transfer and object persistence in distributed systems. 3,112
apache/datafusion A query engine that supports various data formats and allows customization of its functionality. 6,340
apache/iceberg Enables reliable and simple access to huge analytic tables across multiple engines 6,494
apache/camel An integration framework for connecting and integrating various systems and data sources 5,575
jthomasmock/arrow-dplyr A unified data processing framework combining the expressive power of Arrow with the simplicity and efficiency of Dplyr. 38
apache/avro A data serialization system that enables efficient and flexible storage of structured data in various programming languages 2,952
lancedb/lancedb A serverless vector search and storage database built with Rust, enabling efficient similarity searches across multimodal data. 4,757