BentoML

The easiest way to serve AI apps and models - Build reliable Inference APIs, LLM apps, Multi-model chains, RAG service, and much more!

GitHub

7k stars
78 watching
779 forks
Language: Python
last commit: 12 days ago
Linked from 3 awesome lists

ai-inferencedeep-learninggenerative-aiinference-platformllmllm-inferencellm-servingllmopsmachine-learningml-engineeringmlopsmodel-inference-servicemodel-servingmultimodalpython

Backlinks from these awesome lists: