inference

Model deployment hub

A platform for deploying and fine-tuning computer vision models in production-ready environments.

A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.

GitHub

1k stars
23 watching
137 forks
Language: Python
last commit: about 1 month ago
Linked from 2 awesome lists

classificationcomputer-visiondeploymentdockerhacktoberfestinferenceinference-apiinference-serverinstance-segmentationjetsonmachine-learningobject-detectiononnxpythontensorrtvityolo11yolov5yolov7yolov8

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ebhy/budgetml Simplifies deployment of machine learning models to production-ready endpoints with minimal configuration and cost. 1,341
balavenkatesh3322/model_deployment Provides tools and frameworks for deploying machine learning models in production environments 73
seldonio/mlserver An inference server for machine learning models with support for multiple frameworks and scalable deployment options. 737
mlcommons/inference Measures the performance of deep learning models in various deployment scenarios. 1,256
caraml-dev/merlin A platform for deploying and serving machine learning models in a scalable, cost-efficient, and easy-to-use manner 168
tpoisonooo/llama.onnx A project providing onnx models and tools for inference with LLaMa transformer model on various devices 356
nndeploy/nndeploy An end-to-end model deployment framework providing cross-platform simplicity and high performance 662
combust/mleap Enables deployment of machine learning pipelines from Spark and Scikit-Learn to production 1,506
utensor/utensor A lightweight machine learning inference framework built on Tensorflow optimized for Arm targets. 1,742
keras-team/keras-hub A unified interface to various deep learning architectures 818
hxyou/idealgpt A deep learning framework for iteratively decomposing vision and language reasoning via large language models. 32
deploykf/deploykf Builds machine learning platforms on Kubernetes by combining popular tools and services 381
hubfire/muti-branch-ddpg-carla An implementation of a reinforcement learning algorithm using multi-branch architecture and Deep Deterministic Policy Gradients (DDPG) to control autonomous vehicles in simulation environments. 81
patriciogonzalezvivo/prisma A software framework for performing multiple inferences from images or videos and exporting derived data for various applications. 240
roboflow/maestro A tool to streamline fine-tuning of multimodal models for vision-language tasks 1,415