shc

HBase connector

A Spark connector for accessing HBase as an external data source or sink with optimized support for DataFrame and DataSet operations

The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.

GitHub

553 stars
426 watching
280 forks
Language: Scala
last commit: over 3 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
apache/hbase-connectors Connects Apache HBase with other systems like Kafka and Spark for data processing and integration. 236
apache/flink-connector-hbase A connector for integrating HBase with the Apache Flink stream processing framework 30
liuyq-617/td-spark A Java project that reads from and writes to TDengine using Apache Spark. 0
datastax/spark-cassandra-connector A library that enables integration between Apache Spark and Apache Cassandra for fast data processing and analysis. 1,944
orientechnologies/spark-orientdb A Spark module that enables data integration with OrientDB using its native SQL and document-based data model 19
analysys/presto-hbase-connector A Presto connector that enables querying HBase data sets. 240
irvingc/dbscan-on-spark An implementation of the DBSCAN clustering algorithm on top of Apache Spark 184
yaooqinn/itachi A library that brings useful functions from various modern database management systems to Apache Spark 56
mongodb/mongo-spark A Java-based connector for integrating MongoDB with Apache Spark 713
neo4j/neo4j-spark-connector Provides bi-directional access to Neo4j graph data using Apache Spark 313
basho/spark-riak-connector A Spark connector that enables integration with Riak KV and Riak TS datastores 60
mraad/spark-gdb A library that enables Apache Spark to read and query Esri File Geodatabases 25
apache/spark An analytics engine designed to handle large-scale data processing and analysis 40,170
cascading/cascading.hbase A Cascading module providing HBase data access and integration 10
dotnet/spark Provides high-performance APIs for using Apache Spark with .NET 2,032