distributed-dataset
Parallel data processor
A Haskell-based framework for processing and distributing large datasets across multiple nodes in parallel.
A distributed data processing framework in Haskell.
116 stars
13 watching
5 forks
Language: Haskell
last commit: over 4 years ago aws-lambdadata-processingdistributedhaskellspark
Related projects:
Repository | Description | Stars |
---|---|---|
| An environment for efficient data processing using Haskell or R code. | 587 |
| A lightweight Python implementation of Spark's RDD and DStream interfaces for improved performance on small datasets | 262 |
| Enables backpropagation in distributed settings and facilitates model parallelism using differentiable communication between processes | 62 |
| A high-performance platform for large-scale R data processing and analytics | 163 |
| A framework for handling and transforming streaming data in a consistent and efficient way | 903 |
| A set of commands for high-speed processing of large-scale CSV data | 33 |
| Provides a data structure and framework for monotonically-growing concurrent programs | 80 |
| A distributed computing framework for building fault-tolerant and redundant applications | 347 |
| An implementation of Cloud Haskell's core libraries for building distributed systems | 713 |
| A tutorial and code samples for parallel and concurrent programming in Haskell. | 219 |
| A distributed stream processing framework for handling high-volume data streams with fault tolerance and durability guarantees | 817 |
| A distributed key-value database system using the Raft consensus protocol | 4 |
| A toolkit for extracting insights from large datasets by parsing and processing semi-structured data | 214 |
| A proof-of-concept implementation of decentralized machine learning on top of the Golem architecture | 43 |
| A modular framework for building secure, composable, and interoperable on-chain applications | 63 |