datacollector-oss
Data pipeline tool
A continuous big data ingestion platform that enables easy creation of data pipelines for various data sources and destinations.
datacollector-oss
90 stars
10 watching
99 forks
Language: Java
last commit: 4 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
apache/streampipes | A toolbox for industrial data analytics and stream processing | 607 |
ypares/porcupine | A tool that enables data manipulation and analysis pipelines to be flexible, reusable, and reproducible in different environments | 89 |
moby/datakit | A tool to orchestrate applications using a version-controlled dataflow | 1,082 |
nessos/streams | A lightweight library for building efficient data pipelines using functional programming concepts | 383 |
ibmstreams/streamsx.topology | A collection of tools and templates for building streaming applications on IBM Streams using various programming languages. | 29 |
nytlabs/streamtools | A toolkit for manipulating and analyzing streams of data in a graphical manner | 1,312 |
dagster-io/dagster | An orchestration platform for data pipelines and assets, providing a declarative programming model and integrated lineage and observability. | 11,754 |
datasalt/pangool | A Java framework that simplifies Hadoop's MapReduce API to build efficient data processing pipelines | 57 |
joboccara/pipes | A header-only C++14 library for building expressive data pipelines using a chainable interface. | 803 |
cyrusstoller/list-of-lists | A collection of curated lists of software tools and resources for developers | 32 |
apache/datasketches-java | A software library of stochastic streaming algorithms, providing efficient data processing and analysis tools | 896 |
yoshuawuyts/normcore | A JavaScript library that enables the creation of stable, decentralized data streams using hypercore | 28 |
netflix/suro | A distributed data pipeline service for collecting, aggregating, and dispatching large volumes of application events. | 794 |
knowledgeonwebscale/streamingmassif | A Java-based platform for efficient processing of data streams by performing cascading reasoning and complex event processing. | 9 |
apache/rocketmq-connect | A tool for streaming data between Apache RocketMQ and other systems | 122 |