datacollector-oss

Data pipeline tool

A continuous big data ingestion platform that enables easy creation of data pipelines for various data sources and destinations.

datacollector-oss

GitHub

90 stars
10 watching
99 forks
Language: Java
last commit: 4 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
apache/streampipes A toolbox for industrial data analytics and stream processing 607
ypares/porcupine A tool that enables data manipulation and analysis pipelines to be flexible, reusable, and reproducible in different environments 89
moby/datakit A tool to orchestrate applications using a version-controlled dataflow 1,082
nessos/streams A lightweight library for building efficient data pipelines using functional programming concepts 383
ibmstreams/streamsx.topology A collection of tools and templates for building streaming applications on IBM Streams using various programming languages. 29
nytlabs/streamtools A toolkit for manipulating and analyzing streams of data in a graphical manner 1,312
dagster-io/dagster An orchestration platform for data pipelines and assets, providing a declarative programming model and integrated lineage and observability. 11,754
datasalt/pangool A Java framework that simplifies Hadoop's MapReduce API to build efficient data processing pipelines 57
joboccara/pipes A header-only C++14 library for building expressive data pipelines using a chainable interface. 803
cyrusstoller/list-of-lists A collection of curated lists of software tools and resources for developers 32
apache/datasketches-java A software library of stochastic streaming algorithms, providing efficient data processing and analysis tools 896
yoshuawuyts/normcore A JavaScript library that enables the creation of stable, decentralized data streams using hypercore 28
netflix/suro A distributed data pipeline service for collecting, aggregating, and dispatching large volumes of application events. 794
knowledgeonwebscale/streamingmassif A Java-based platform for efficient processing of data streams by performing cascading reasoning and complex event processing. 9
apache/rocketmq-connect A tool for streaming data between Apache RocketMQ and other systems 122