arrow
Data interchange toolkit
A toolkit for efficient data interchange and in-memory analytics in various languages
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
15k stars
351 watching
4k forks
Language: C++
last commit: 1 day ago
Linked from 3 awesome lists
arrow
Related projects:
Repository | Description | Stars |
---|---|---|
duckdb/arrow | An extension for DuckDB that enables support for Apache Arrow, providing functions to serialize tables and scan arrow ipc buffers. | 34 |
wesm/feather | A binary data frame storage system that enables efficient and interoperable data sharing across multiple programming languages. | 2,742 |
apache/datafusion-ballista-python | Bindings for using Apache Arrow's query engine in Python to analyze and manipulate large datasets | 33 |
apache/orc | A columnar storage format for Hadoop workloads that optimizes data access and reduces query performance requirements. | 692 |
android10/arrow | A collection of reusable utility classes and methods for Java and Android development | 439 |
apache/datafusion-ballista | Distributed query engine for Apache DataFusion applications | 1,551 |
lancedb/lance | A modern columnar data format for machine learning and large language models. | 3,956 |
apache/fury | A high-performance serialization framework that supports multiple languages and protocols, designed to improve data transfer and object persistence in distributed systems. | 3,112 |
apache/datafusion | A query engine that supports various data formats and allows customization of its functionality. | 6,340 |
apache/iceberg | Enables reliable and simple access to huge analytic tables across multiple engines | 6,494 |
apache/camel | An integration framework for connecting and integrating various systems and data sources | 5,575 |
jthomasmock/arrow-dplyr | A unified data processing framework combining the expressive power of Arrow with the simplicity and efficiency of Dplyr. | 38 |
apache/avro | A data serialization system that enables efficient and flexible storage of structured data in various programming languages | 2,952 |
lancedb/lancedb | A serverless vector search and storage database built with Rust, enabling efficient similarity searches across multimodal data. | 4,757 |