capillaries
Data processor
A distributed batch data processing framework that enables scalable and reliable data transformation, filtering, and aggregation.
Distributed batch data processing framework
62 stars
0 watching
2 forks
Language: Go
last commit: about 1 year ago
Linked from 1 awesome list
batch-processingcassandradagdistributed-computingdistributed-systemsgogolangrabbitmqrelational-algebraworkflow-engineworkflows
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | A Python library that provides a data processing and querying framework using the Apache Arrow in-memory query engine. | 385 |
| | A framework for organizing and processing multimedia data in a modular and flexible way | 1 |
| | A Python framework for real-time data processing on Apache Kafka streams | 1,246 |
| | A library for creating data workflows that can be simple or complex, with features like recursion and memoization. | 159 |
| | A package of tools and functions for processing and analyzing atmospheric model output and observational data. | 14 |
| | A PHP implementation of functional programming concepts to simplify data processing and analysis. | 245 |
| | A workflow engine for unifying feature engineering and machine learning operations in data analysis pipelines | 1 |
| | A high-performance platform for large-scale R data processing and analytics | 163 |
| | A framework for handling and transforming streaming data in a consistent and efficient way | 903 |
| | A workflow management system designed to efficiently run pipelines in various environments. | 901 |
| | A collection of utilities and examples for processing RDF data using various big-data technologies. | 24 |
| | A Python package for manipulating 2-dimensional tabular data structures with an emphasis on speed and big data support. | 1,821 |
| | A distributed stream processing framework for real-time data reactions | 1,477 |
| | A JSON query processor with a custom syntax that simplifies complex queries by breaking them down into step-by-step operations. | 895 |
| | A data engineering project that extracts insights from Python projects using DuckDB and MotherDuck. | 173 |