pangool
Data pipeline builder
A Java framework that simplifies Hadoop's MapReduce API to build efficient data processing pipelines
Tuple MapReduce for Hadoop: Hadoop API made easy
57 stars
12 watching
13 forks
Language: Java
last commit: over 2 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A TypeScript library that enables the creation of modular, composable, and reusable data processing pipelines | 25 |
| A system that enables flexible data processing pipelines using a low-level engine for higher-level frameworks | 482 |
| A collection of libraries for working with large-scale data in Hadoop, providing incremental processing capabilities and user-defined functions. | 583 |
| A tool for creating flexible and self-documenting data science pipelines | 56 |
| A tool that enables data manipulation and analysis pipelines to be flexible, reusable, and reproducible in different environments | 89 |
| A command-line tool for automating data processing and uploads from Planet's API to Google Earth Engine. | 42 |
| A distributed data pipeline service for collecting, aggregating, and dispatching large volumes of application events. | 794 |
| A distributed system for streaming data between heterogeneous systems with high reliability and throughput at scale | 931 |
| Simplifies the deployment of Kubeflow Pipelines workflows by providing a graphical interface for Data Scientists to define and deploy pipelines directly from JupyterLab. | 632 |
| A toolbox for industrial data analytics and stream processing | 614 |
| A collection of Java implementations of various data structures and algorithms used in computer science | 146 |
| A library for composing and chaining functions on Observables in RxJava to simplify complex data processing pipelines. | 49 |
| Automates end-to-end machine learning pipeline deployment with AWS services | 111 |
| A Clojure-based library for writing efficient MapReduce programs on the Hadoop platform | 257 |
| Provides a set of pre-defined data processing pipelines for pandas DataFrames. | 718 |