disco

Big-data processor

A distributed computing framework for parallel processing of large data sets

a Map/Reduce framework for distributed computing

GitHub

2k stars
85 watching
241 forks
Language: Erlang
last commit: almost 7 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
castagna/jena-grande A collection of utilities and examples for processing RDF data using various big-data technologies. 24
vertica/distributedr A high-performance platform for large-scale R data processing and analytics 163
python-bonobo/bonobo A Python framework for parallelizing data transformations and processing 1,589
apache/samza A distributed stream processing framework for handling high-volume data streams with fault tolerance and durability guarantees 820
scicloj/tablecloth A dataset manipulation library built on top of tech.ml.dataset, providing a simplified API for data processing and analysis. 303
reubano/meza A lightweight toolkit for processing tabular data with a focus on functional programming and PyPy compatibility. 416
spreads/spreads A high-performance library for real-time data processing and time series manipulation 427
dalmatinerdb/dproto A protocol defining data exchange formats for a specific relational database system. 1
bl3f/yato An orchestrator for DuckDB databases that automates data transformation and integration with other tools. 174
dkogan/vnlog A toolkit for manipulating tabular ASCII data with normal UNIX tools. 160
ileriayo/ileriayo A software project focused on developing a system for managing and processing complex data flows. 52
funkygao/cp-ddd-framework A framework for building and evolving complex business systems using Domain Driven Design principles 1,123
fogfish/datum A set of functional programming abstractions and data structures for Erlang 124
nco/nco A suite of command-line programs for manipulating and analyzing scientific data stored in netCDF formats 173
nysol/mcmd A set of commands for high-speed processing of large-scale CSV data 33