serve

AI service framework

A framework for building and deploying AI services that can be scaled from local development to production

☁️ Build multimodal AI applications with cloud-native stack

GitHub

21k stars
215 watching
2k forks
Language: Python
last commit: 3 months ago
cloud-nativecncfdeep-learningdockerfastapiframeworkgenerative-aigrpcjaegerkubernetesllmopsmachine-learningmicroservicemlopsmultimodalneural-searchopentelemetryorchestrationpipelineprometheus

Related projects:

Repository Description Stars
jina-ai/clip-as-service A service that provides fast and scalable image-text embeddings using CLIP models, supporting visual reasoning and integration with neural search ecosystems. 12,497
jina-ai/dalle-flow An interactive workflow for generating high-definition images from text prompts using a human-in-the-loop approach 2,837
fedml-ai/fedml A unified and scalable machine learning library for large-scale distributed training, model serving, and federated learning. 4,205
meta-llama/llama-stack Provides pre-packaged building blocks for generative AI applications with standardized APIs and service-oriented design. 5,164
bentoml/bentoml An open-source Python framework for building model inference APIs and serving AI models in production environments. 7,222
kong/kong A platform that provides a centralized layer for managing and orchestrating API traffic and microservices 39,568
tensorflow/serving A high-performance serving system for machine learning models in production environments. 6,195
gofireflyio/aiac A tool that generates Infrastructure as Code templates and configurations using large language models. 3,549
livekit/agents An open-source framework for building real-time, multimodal AI applications with flexible integrations and a focus on voice agents and conversational flow. 4,210
vercel/ai A toolkit for building AI-powered applications with various frameworks and model providers 10,554
google-ai-edge/mediapipe A platform providing pre-built machine learning models and APIs for cross-platform deployment on various devices 27,962
sinaptik-ai/pandas-ai Makes data analysis conversational using LLMs and natural language 13,714
significant-gravitas/autogpt A platform for building and deploying autonomous AI agents to automate complex workflows 169,186
mindsdb/mindsdb A platform for building AI agents that can learn from and answer questions across multiple data sources using machine learning and natural language processing. 26,915
portkey-ai/gateway A fast and secure routing service for integrating with large language models 6,557