spark-cassandra-connector

Data processor

A library that enables integration between Apache Spark and Apache Cassandra for fast data processing and analysis.

DataStax Connector for Apache Spark to Apache Cassandra

GitHub

2k stars
163 watching
922 forks
Language: Scala
last commit: 3 months ago
Linked from 3 awesome lists

cassandrascalaspark

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
datastax/spark-cassandra-stress A tool for testing the performance and stability of data integration between Apache Spark and Cassandra databases. 25
apache/spark An analytics engine designed to handle large-scale data processing and analysis 40,066
datastax/csharp-driver A C# client library for interacting with Apache Cassandra databases 640
datastax/python-driver A Python client library for interacting with Apache Cassandra databases 1,393
datastax/cpp-driver A C++ client library for Apache Cassandra 404
svenkreiss/pysparkling A lightweight Python implementation of Spark's RDD and DStream interfaces for improved performance on small datasets 262
datastax/nodejs-driver A Node.js client library for interacting with Apache Cassandra databases 1,242
datastax/php-driver A PHP client library for interacting with Apache Cassandra databases 435
instaclustr/sample-kafkasparkcassandra An introductory Scala app using Apache Spark Streaming to process data from Kafka and write summaries to Cassandra. 23
hortonworks-spark/shc A Spark connector for accessing HBase as an external data source or sink with optimized support for DataFrame and DataSet operations 553
databricks/spark-xml A library that parses and queries XML data in Apache Spark 504
datastax/cass-operator Automates deployment and management of Apache Cassandra clusters on Kubernetes 257
microsoft/mobius Provides a C# API for interacting with Apache Spark 941
dotnet/spark Provides high-performance APIs for using Apache Spark with .NET 2,031
indix/sparkplug A Spark-based package to apply data fixes using rule-based SQL conditions 28