disco
Big-data processor
A distributed computing framework for parallel processing of large data sets
a Map/Reduce framework for distributed computing
2k stars
85 watching
241 forks
Language: Erlang
last commit: about 7 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A collection of utilities and examples for processing RDF data using various big-data technologies. | 24 |
| A high-performance platform for large-scale R data processing and analytics | 163 |
| A Python framework for parallelizing data transformations and processing | 1,589 |
| A distributed stream processing framework for handling high-volume data streams with fault tolerance and durability guarantees | 817 |
| A dataset manipulation library built on top of tech.ml.dataset, providing a simplified API for data processing and analysis. | 308 |
| A lightweight toolkit for processing tabular data with a focus on functional programming and PyPy compatibility. | 417 |
| A high-performance library for real-time data processing and time series manipulation | 430 |
| A protocol defining data exchange formats for a specific relational database system. | 1 |
| An orchestrator for DuckDB databases that automates data transformation and integration with other tools. | 182 |
| A toolkit for manipulating tabular ASCII data with normal UNIX tools. | 161 |
| A software project focused on developing a system for managing and processing complex data flows. | 52 |
| A framework for building and evolving complex business systems using Domain Driven Design principles | 1,127 |
| A set of functional programming abstractions and data structures for Erlang | 124 |
| A suite of command-line programs for manipulating and analyzing scientific data stored in netCDF formats | 174 |
| A set of commands for high-speed processing of large-scale CSV data | 33 |