arrow
Data interchange toolkit
A toolkit for efficient data interchange and in-memory analytics in various languages
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
15k stars
352 watching
4k forks
Language: C++
last commit: about 1 month ago
Linked from 3 awesome lists
arrow
Related projects:
Repository | Description | Stars |
---|---|---|
duckdb/arrow | An extension for DuckDB that enables support for Apache Arrow, providing functions to serialize tables and scan arrow ipc buffers. | 36 |
wesm/feather | A binary data frame storage system that enables efficient and interoperable data sharing across multiple programming languages. | 2,742 |
apache/datafusion-ballista-python | Bindings for using Apache Arrow's query engine in Python to analyze and manipulate large datasets | 34 |
apache/orc | A columnar storage format for Hadoop workloads that optimizes data access and reduces query performance requirements. | 698 |
android10/arrow | A collection of reusable utility classes and methods for Java and Android development | 439 |
apache/datafusion-ballista | Distributed query engine for Apache DataFusion applications | 1,580 |
lancedb/lance | A modern columnar data format for machine learning and large language models. | 4,010 |
apache/fury | A high-performance serialization framework that supports multiple languages and protocols, designed to improve data transfer and object persistence in distributed systems. | 3,133 |
apache/datafusion | A query engine that supports various data formats and allows customization of its functionality. | 6,462 |
apache/iceberg | Enables reliable and simple access to huge analytic tables across multiple engines | 6,621 |
apache/camel | An integration framework for connecting and integrating various systems and data sources | 5,603 |
jthomasmock/arrow-dplyr | A unified data processing framework combining the expressive power of Arrow with the simplicity and efficiency of Dplyr. | 38 |
apache/avro | A data serialization system that enables efficient and flexible storage of structured data in various programming languages | 2,973 |
lancedb/lancedb | A serverless vector search and storage database built with Rust, enabling efficient similarity searches across multimodal data. | 4,993 |