alluxio

Storage hub

A distributed storage system for bridging the gap between computation frameworks and storage systems.

Alluxio, data orchestration for analytics and machine learning in the cloud

GitHub

7k stars
443 watching
3k forks
Language: Java
last commit: about 2 months ago
Linked from 4 awesome lists

alluxiodata-analysisdata-orchestrationhadoopmemory-speedprestosparktensorflowvirtual-distributed-filesystem

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
exo-explore/exo An experimental software framework to run AI models on diverse devices without requiring expensive GPUs. 17,369
nuclio/nuclio A high-performance platform for real-time event and data processing 5,339
istio/istio An open source service mesh that integrates and secures microservices in a distributed application 36,240
pachyderm/pachyderm Automates data transformations with versioning and lineage tracking for scalable data pipelines 6,191
karmada-io/karmada An orchestration system that allows running cloud-native applications across multiple Kubernetes clusters and clouds with minimal changes to the applications. 4,528
vmware-tanzu/velero Tools for backing up and restoring Kubernetes cluster resources and persistent volumes 8,846
zabbix/zabbix An enterprise-class monitoring solution designed to track performance and availability of IT resources and services in real-time. 4,484
kedro-org/kedro A toolbox for production-ready data science pipelines with software engineering best practices for reproducibility and modularity 10,050
datashaman/putio-automator Automates tasks related to managing torrents, transfers, and files on Put.IO 70
opentofu/opentofu An OSS tool for safely and efficiently managing cloud infrastructure using a high-level configuration syntax 23,594
istio-ecosystem/admiral Automates configuration and service discovery for multicluster Istio service mesh 593
dashbitco/broadway A concurrent and multi-stage data ingestion and processing framework in Elixir 2,447
abrander/agento Collects near real-time metrics from Linux hosts using InfluxDB as the backend. 28
datahub-project/datahub A platform for managing and discovering data across an organization's data stack 10,046
multiprocessio/datastation An all-in-one application for querying, scripting, and visualizing data from various sources 2,907