disco
Big-data processor
A distributed computing framework for parallel processing of large data sets
a Map/Reduce framework for distributed computing
2k stars
85 watching
241 forks
Language: Erlang
last commit: almost 7 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
castagna/jena-grande | A collection of utilities and examples for processing RDF data using various big-data technologies. | 24 |
vertica/distributedr | A high-performance platform for large-scale R data processing and analytics | 163 |
python-bonobo/bonobo | A Python framework for parallelizing data transformations and processing | 1,589 |
apache/samza | A distributed stream processing framework for handling high-volume data streams with fault tolerance and durability guarantees | 820 |
scicloj/tablecloth | A dataset manipulation library built on top of tech.ml.dataset, providing a simplified API for data processing and analysis. | 303 |
reubano/meza | A lightweight toolkit for processing tabular data with a focus on functional programming and PyPy compatibility. | 416 |
spreads/spreads | A high-performance library for real-time data processing and time series manipulation | 427 |
dalmatinerdb/dproto | A protocol defining data exchange formats for a specific relational database system. | 1 |
bl3f/yato | An orchestrator for DuckDB databases that automates data transformation and integration with other tools. | 174 |
dkogan/vnlog | A toolkit for manipulating tabular ASCII data with normal UNIX tools. | 160 |
ileriayo/ileriayo | A software project focused on developing a system for managing and processing complex data flows. | 52 |
funkygao/cp-ddd-framework | A framework for building and evolving complex business systems using Domain Driven Design principles | 1,123 |
fogfish/datum | A set of functional programming abstractions and data structures for Erlang | 124 |
nco/nco | A suite of command-line programs for manipulating and analyzing scientific data stored in netCDF formats | 173 |
nysol/mcmd | A set of commands for high-speed processing of large-scale CSV data | 33 |