spark-xml
XML parser
A library that parses and queries XML data in Apache Spark
XML data source for Spark SQL and DataFrames
504 stars
39 watching
226 forks
Language: Scala
last commit: 6 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A library for parsing and querying CSV data with Apache Spark | 1,052 |
| An analytics engine designed to handle large-scale data processing and analysis | 40,170 |
| A Spark-based package to apply data fixes using rule-based SQL conditions | 28 |
| Enables manipulation of Apache Spark DataFrames using TensorFlow programs | 749 |
| Wraps Stanford CoreNLP annotators as Spark DataFrame functions for natural language processing tasks | 422 |
| A research-focused implementation of Apache Spark with homomorphic encryption support | 3 |
| A library that enables integration between Apache Spark and Apache Cassandra for fast data processing and analysis. | 1,944 |
| A Swift wrapper around XML parsing APIs, providing a simple way to parse XML into dictionary of arrays. | 1,412 |
| Implementations of clustering algorithms using Spark in Scala | 18 |
| Utilities and API for parsing and streaming XML data in Scala | 60 |
| Provides high-performance APIs for using Apache Spark with .NET | 2,032 |
| An XML parsing library implemented in Swift | 584 |
| A Docker-based environment for running Spark and Iceberg in a quick start scenario. | 264 |
| A lightweight Python implementation of Spark's RDD and DStream interfaces for improved performance on small datasets | 262 |
| An implementation of data stream clustering algorithms using Spark Streaming. | 3 |