hive

Data warehouse tool

A software project that enables data warehousing and management of large datasets using SQL

Apache Hive

GitHub

6k stars
326 watching
5k forks
Language: Java
last commit: about 1 month ago
Linked from 2 awesome lists

apachebig-datadatabasehadoophivejavasql

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
apache/hudi A platform for storing and managing big data in cloud storage, enabling incremental processing and optimized querying of large datasets 5,498
apache/shardingsphere A distributed SQL query and transaction engine for sharding, scaling, encryption, and more on any database 20,034
apache/kyuubi An Apache project providing a distributed and multi-tenant gateway to enable serverless SQL on data warehouses and lakehouses 2,116
apache/kylin An OLAP engine designed to handle Big Data with sub-second query latency and seamless integration with BI tools. 3,661
apache/cassandra A highly scalable, partitioned row store that allows flexible data distribution and organization. 8,906
apache/hbase A distributed, versioned, column-oriented store designed to scale and manage large amounts of structured data 5,246
apache/datafusion A query engine that supports various data formats and allows customization of its functionality. 6,462
dbeaver/dbeaver A multi-platform tool for connecting to and managing various databases 40,942
crate/crate A distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time. 4,139
apache/ignite A distributed, in-memory database system for high-performance computing and data processing 4,834
apache/drill A distributed query layer for Hadoop and NoSQL data storage systems, supporting various query languages. 1,949
apache/arrow A toolkit for efficient data interchange and in-memory analytics in various languages 14,728
apache/datafusion-ballista Distributed query engine for Apache DataFusion applications 1,580
apache/iotdb A time-series data management system for industrial IoT applications 5,651
apache/tez A system that enables flexible data processing pipelines using a low-level engine for higher-level frameworks 482