hive
Data warehouse tool
A software project that enables data warehousing and management of large datasets using SQL
Apache Hive
6k stars
326 watching
5k forks
Language: Java
last commit: about 1 month ago
Linked from 2 awesome lists
apachebig-datadatabasehadoophivejavasql
Related projects:
Repository | Description | Stars |
---|---|---|
apache/hudi | A platform for storing and managing big data in cloud storage, enabling incremental processing and optimized querying of large datasets | 5,498 |
apache/shardingsphere | A distributed SQL query and transaction engine for sharding, scaling, encryption, and more on any database | 20,034 |
apache/kyuubi | An Apache project providing a distributed and multi-tenant gateway to enable serverless SQL on data warehouses and lakehouses | 2,116 |
apache/kylin | An OLAP engine designed to handle Big Data with sub-second query latency and seamless integration with BI tools. | 3,661 |
apache/cassandra | A highly scalable, partitioned row store that allows flexible data distribution and organization. | 8,906 |
apache/hbase | A distributed, versioned, column-oriented store designed to scale and manage large amounts of structured data | 5,246 |
apache/datafusion | A query engine that supports various data formats and allows customization of its functionality. | 6,462 |
dbeaver/dbeaver | A multi-platform tool for connecting to and managing various databases | 40,942 |
crate/crate | A distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time. | 4,139 |
apache/ignite | A distributed, in-memory database system for high-performance computing and data processing | 4,834 |
apache/drill | A distributed query layer for Hadoop and NoSQL data storage systems, supporting various query languages. | 1,949 |
apache/arrow | A toolkit for efficient data interchange and in-memory analytics in various languages | 14,728 |
apache/datafusion-ballista | Distributed query engine for Apache DataFusion applications | 1,580 |
apache/iotdb | A time-series data management system for industrial IoT applications | 5,651 |
apache/tez | A system that enables flexible data processing pipelines using a low-level engine for higher-level frameworks | 482 |