shc
HBase connector
A Spark connector for accessing HBase as an external data source or sink with optimized support for DataFrame and DataSet operations
The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.
553 stars
426 watching
280 forks
Language: Scala
last commit: over 3 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
apache/hbase-connectors | Connects Apache HBase with other systems like Kafka and Spark for data processing and integration. | 236 |
apache/flink-connector-hbase | A connector for integrating HBase with the Apache Flink stream processing framework | 30 |
liuyq-617/td-spark | A Java project that reads from and writes to TDengine using Apache Spark. | 0 |
datastax/spark-cassandra-connector | A library that enables integration between Apache Spark and Apache Cassandra for fast data processing and analysis. | 1,944 |
orientechnologies/spark-orientdb | A Spark module that enables data integration with OrientDB using its native SQL and document-based data model | 19 |
analysys/presto-hbase-connector | A Presto connector that enables querying HBase data sets. | 240 |
irvingc/dbscan-on-spark | An implementation of the DBSCAN clustering algorithm on top of Apache Spark | 184 |
yaooqinn/itachi | A library that brings useful functions from various modern database management systems to Apache Spark | 56 |
mongodb/mongo-spark | A Java-based connector for integrating MongoDB with Apache Spark | 713 |
neo4j/neo4j-spark-connector | Provides bi-directional access to Neo4j graph data using Apache Spark | 313 |
basho/spark-riak-connector | A Spark connector that enables integration with Riak KV and Riak TS datastores | 60 |
mraad/spark-gdb | A library that enables Apache Spark to read and query Esri File Geodatabases | 25 |
apache/spark | An analytics engine designed to handle large-scale data processing and analysis | 40,170 |
cascading/cascading.hbase | A Cascading module providing HBase data access and integration | 10 |
dotnet/spark | Provides high-performance APIs for using Apache Spark with .NET | 2,032 |