disco 
 Big-data processor
 A distributed computing framework for parallel processing of large data sets
a Map/Reduce framework for distributed computing
2k stars
 85 watching
 241 forks
 
Language: Erlang 
last commit: almost 8 years ago 
Linked from   1 awesome list  
 Related projects:
| Repository | Description | Stars | 
|---|---|---|
|    |  A collection of utilities and examples for processing RDF data using various big-data technologies. | 24 | 
|    |  A high-performance platform for large-scale R data processing and analytics | 163 | 
|    |  A Python framework for parallelizing data transformations and processing | 1,589 | 
|    |  A distributed stream processing framework for handling high-volume data streams with fault tolerance and durability guarantees | 817 | 
|    |  A dataset manipulation library built on top of tech.ml.dataset, providing a simplified API for data processing and analysis. | 308 | 
|    |  A lightweight toolkit for processing tabular data with a focus on functional programming and PyPy compatibility. | 417 | 
|    |  A high-performance library for real-time data processing and time series manipulation | 430 | 
|    |  A protocol defining data exchange formats for a specific relational database system. | 1 | 
|    |  An orchestrator for DuckDB databases that automates data transformation and integration with other tools. | 182 | 
|    |  A toolkit for manipulating tabular ASCII data with normal UNIX tools. | 161 | 
|    |  A software project focused on developing a system for managing and processing complex data flows. | 52 | 
|    |  A framework for building and evolving complex business systems using Domain Driven Design principles | 1,127 | 
|    |  A set of functional programming abstractions and data structures for Erlang | 124 | 
|    |  A suite of command-line programs for manipulating and analyzing scientific data stored in netCDF formats | 174 | 
|    |  A set of commands for high-speed processing of large-scale CSV data | 33 |