serve

AI service framework

A framework for building and deploying AI services that can be scaled from local development to production

☁️ Build multimodal AI applications with cloud-native stack

GitHub

21k stars
213 watching
2k forks
Language: Python
last commit: 9 days ago
cloud-nativecncfdeep-learningdockerfastapiframeworkgenerative-aigrpcjaegerkubernetesllmopsmachine-learningmicroservicemlopsmultimodalneural-searchopentelemetryorchestrationpipelineprometheus

Related projects:

Repository Description Stars
jina-ai/clip-as-service A service that provides fast and scalable image-text embeddings using CLIP models, supporting visual reasoning and integration with neural search ecosystems. 12,455
jina-ai/dalle-flow An interactive workflow for generating high-definition images from text prompts using a human-in-the-loop approach 2,834
fedml-ai/fedml A unified and scalable machine learning library for large-scale distributed training, model serving, and federated learning. 4,187
meta-llama/llama-stack Provides a set of standardized APIs and tools to build generative AI applications 4,591
bentoml/bentoml An open-source Python framework for building model inference APIs and serving AI models in production environments. 7,153
kong/kong A platform that provides a centralized layer for managing and orchestrating API traffic and microservices 39,308
tensorflow/serving A high-performance serving system for machine learning models in production environments. 6,185
gofireflyio/aiac A tool that generates Infrastructure as Code templates and configurations using large language models. 3,528
livekit/agents A framework for building real-time AI applications that can perceive and respond to user input through multiple media channels. 3,990
vercel/ai A toolkit for building AI-powered applications with various frameworks and model providers 10,114
google-ai-edge/mediapipe A platform providing pre-built machine learning models and APIs for cross-platform deployment on various devices 27,608
sinaptik-ai/pandas-ai Makes data analysis conversational using LLMs and natural language 13,516
significant-gravitas/autogpt A platform for building and deploying autonomous AI agents to automate complex workflows 168,407
mindsdb/mindsdb An AI platform for building agents that can learn and answer questions over federated data from various sources. 26,793
portkey-ai/gateway A fast and reliable AI routing service with built-in guardrails for generating requests to multiple large language models. 6,290