distributed-dataset

Parallel data processor

A Haskell-based framework for processing and distributing large datasets across multiple nodes in parallel.

A distributed data processing framework in Haskell.

GitHub

116 stars
13 watching
5 forks
Language: Haskell
last commit: over 4 years ago
aws-lambdadata-processingdistributedhaskellspark

Related projects:

Repository Description Stars
tweag/haskellr An environment for efficient data processing using Haskell or R code. 587
svenkreiss/pysparkling A lightweight Python implementation of Spark's RDD and DStream interfaces for improved performance on small datasets 262
ag14774/diffdist Enables backpropagation in distributed settings and facilitates model parallelism using differentiable communication between processes 62
vertica/distributedr A high-performance platform for large-scale R data processing and analytics 163
snoyberg/conduit A framework for handling and transforming streaming data in a consistent and efficient way 903
nysol/mcmd A set of commands for high-speed processing of large-scale CSV data 33
iu-parfunc/lvars Provides a data structure and framework for monotonically-growing concurrent programs 80
jepst/cloudhaskell A distributed computing framework for building fault-tolerant and redundant applications 347
haskell-distributed/distributed-process An implementation of Cloud Haskell's core libraries for building distributed systems 713
simonmar/par-tutorial A tutorial and code samples for parallel and concurrent programming in Haskell. 219
apache/samza A distributed stream processing framework for handling high-volume data streams with fault tolerance and durability guarantees 817
hslam/raftdb A distributed key-value database system using the Raft consensus protocol 4
jondot/crunch A toolkit for extracting insights from large datasets by parsing and processing semi-structured data 214
anshuman73/deml-golem A proof-of-concept implementation of decentralized machine learning on top of the Golem architecture 43
abstractsdk/abstract A modular framework for building secure, composable, and interoperable on-chain applications 63