inference

Model deployment hub

A platform for deploying and fine-tuning computer vision models in production-ready environments.

A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.

GitHub

1k stars
23 watching
130 forks
Language: Python
last commit: about 15 hours ago
Linked from 2 awesome lists

classificationcomputer-visiondeploymentdockerhacktoberfestinferenceinference-apiinference-serverinstance-segmentationjetsonmachine-learningobject-detectiononnxpythontensorrtvityolo11yolov5yolov7yolov8

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ebhy/budgetml Simplifies deployment of machine learning models to production-ready endpoints with minimal configuration and cost. 1,338
balavenkatesh3322/model_deployment Provides tools and frameworks for deploying machine learning models in production environments 73
seldonio/mlserver An inference server for machine learning models with support for multiple frameworks and scalable deployment options. 720
mlcommons/inference Measures the performance of deep learning models in various deployment scenarios. 1,236
caraml-dev/merlin A platform for deploying and serving machine learning models in a scalable, cost-efficient, and easy-to-use manner 167
tpoisonooo/llama.onnx A project providing onnx models and tools for inference with LLaMa transformer model on various devices 352
nndeploy/nndeploy An end-to-end model deployment framework providing cross-platform simplicity and high performance 632
combust/mleap Enables deployment of machine learning data pipelines and algorithms to production 1,504
utensor/utensor A lightweight machine learning inference framework built on Tensorflow optimized for Arm targets. 1,729
keras-team/keras-hub Provides pre-trained models and building blocks for natural language processing, computer vision, audio, and multimodal tasks 797
hxyou/idealgpt A deep learning framework for iteratively decomposing vision and language reasoning via large language models. 32
deploykf/deploykf Builds machine learning platforms on Kubernetes by combining popular tools and services 376
hubfire/muti-branch-ddpg-carla An implementation of a reinforcement learning algorithm using multi-branch architecture and Deep Deterministic Policy Gradients (DDPG) to control autonomous vehicles in simulation environments. 79
patriciogonzalezvivo/prisma A software framework for performing multiple inferences from images or videos and exporting derived data for various applications. 222
roboflow/maestro A tool to streamline fine-tuning of multimodal models for vision-language tasks 1,392