sparkplug
Data fixer
A Spark-based package to apply data fixes using rule-based SQL conditions
Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌
28 stars
7 watching
2 forks
Language: Scala
last commit: almost 5 years ago datapipelinesparkspark-sql
Related projects:
Repository | Description | Stars |
---|---|---|
| A library that parses and queries XML data in Apache Spark | 504 |
| An analytics engine designed to handle large-scale data processing and analysis | 40,170 |
| A library that enables integration between Apache Spark and Apache Cassandra for fast data processing and analysis. | 1,944 |
| A library of reusable code for building scalable Spark applications | 19 |
| A tool for testing the performance and stability of data integration between Apache Spark and Cassandra databases. | 25 |
| A library for parsing and querying CSV data with Apache Spark | 1,052 |
| An R interface to Apache Spark for distributed data analysis and machine learning | 955 |
| Provides high-performance APIs for using Apache Spark with .NET | 2,032 |
| A PHP library that maps database tables into objects with automated relationship management and validation. | 51 |
| A testing helper library for Apache Spark applications. | 437 |
| A set of Python libraries and tools to simplify interactions with various data sources using Apache Spark. | 61 |
| A tool that simplifies testing and development with Codeigniter 3 by providing an application instance as a single variable. | 15 |
| An extension for Google Refine that adds columns to reconciled data from DBpedia | 39 |
| Provides tools to read data from Solr and write it to Spark DataFrames/RDDs, enabling integration with Solr. | 445 |
| A command-line interface to Cisco Spark | 14 |