iceberg

Table engine

Enables reliable and simple access to huge analytic tables across multiple engines

Apache Iceberg

GitHub

7k stars
164 watching
2k forks
Language: Java
last commit: about 1 month ago
Linked from 2 awesome lists

apachehacktoberfesticeberg

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
apache/ignite A distributed, in-memory database system for high-performance computing and data processing 4,834
dbeaver/dbeaver A multi-platform tool for connecting to and managing various databases 40,942
apache/shardingsphere A distributed SQL query and transaction engine for sharding, scaling, encryption, and more on any database 20,034
apache/arrow A toolkit for efficient data interchange and in-memory analytics in various languages 14,728
apache/kylin An OLAP engine designed to handle Big Data with sub-second query latency and seamless integration with BI tools. 3,661
apache/incubator-hugegraph A fast and scalable graph database for storing and querying billions of vertices and edges. 2,663
apache/hudi A platform for storing and managing big data in cloud storage, enabling incremental processing and optimized querying of large datasets 5,498
apache/hive A software project that enables data warehousing and management of large datasets using SQL 5,577
apache/datafusion A query engine that supports various data formats and allows customization of its functionality. 6,462
apache/flink An open-source stream processing framework with powerful capabilities for handling high-throughput and low-latency data streams in various programming languages 24,261
apache/pinot A distributed real-time analytics system with low latency 5,562
apache/fury A high-performance serialization framework that supports multiple languages and protocols, designed to improve data transfer and object persistence in distributed systems. 3,133
erikgrinaker/toydb A distributed, educational, in-memory relational database project built with Rust to demonstrate its architecture and concepts. 6,271
lindb/lindb A high-performance, distributed time series database with horizontal scalability and high availability 3,010
jtablesaw/tablesaw A Java library for data manipulation and visualization 3,564