kubedl

Workload manager

Enables efficient deep learning workload deployment on Kubernetes

Run your deep learning workloads on Kubernetes more easily and efficiently.

GitHub

513 stars
22 watching
79 forks
Language: Go
last commit: 10 months ago
Linked from 1 awesome list

containerdeep-learninginferencekubernetesmachine-learningmodelscheduling

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
kubeflow/katib An automated machine learning system that supports hyperparameter tuning and neural architecture search on Kubernetes. 1,521
sysdiglabs/kubectl-dig Provides deep Kubernetes visibility from the kubectl by digging into cluster resources and metadata. 383
containerd/runwasi Facilitates running WebAssembly workloads on a container runtime 1,114
kubeflow/arena A CLI tool for managing machine learning training jobs and monitoring GPU resources in Kubernetes clusters. 743
federatedai/kubefate A tool for managing federated learning workloads using cloud-native technologies like Kubernetes and Docker Compose. 423
corneliusweig/ketall A tool to list all Kubernetes resources in a cluster 687
joerdav/xc Solves the problem of scripts and documentation being separate entities, by defining them inline with the documentation in a human-readable format 1,206
yanglei99/ycsb_mesos Represents a setup to run the YCSB workload on various data stores using Mesos and Docker containers. 1
kubehelper/kubehelper Simplifies daily Kubernetes tasks through a web interface 246
steebchen/kubectl A GitHub Action that integrates Kubernetes CLI functionality into Continuous Integration/Continuous Deployment pipelines. 215
therfoo/therfoo An embedded deep learning library for Go. 18
guessi/kubectl-grep A plugin for Kubernetes' `kubectl` command to filter resources by matching their names. 114
kube-burner/kube-burner A toolset for testing and optimizing Kubernetes clusters' performance and scale 508
dirkaholic/kyoo An unlimited job queue with concurrent worker pools for Go applications 51
elotl/kip A tool that allows a Kubernetes cluster to run pods on its own cloud instances, automating node management and scaling. 225