llama_deploy

Deployment framework

An async-first framework for deploying and scaling agentic multi-service systems

Deploy your agentic worfklows to production

GitHub

2k stars
25 watching
189 forks
Language: Python
last commit: 9 days ago
Linked from 1 awesome list

agentsdeploymentframeworkllamaindexllmmulti-agents

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
run-llama/llamaindexts A data framework for integrating large language models into applications with custom data 1,960
talkdai/dialog An application framework to simplify the deployment and testing of large language models (LLMs) for natural language processing tasks. 377
soulteary/docker-llama2-chat An implementation of LLaMA2 in Docker, allowing users to quickly deploy and run the model on their local machine. 536
balavenkatesh3322/model_deployment Provides tools and frameworks for deploying machine learning models in production environments 73
snunez1/llama.cl A Common Lisp port of a Large Language Model (LLM) implementation 35
maximilian-winter/llama-cpp-agent A tool for easy interaction with Large Language Models (LLMs) to execute structured function calls and generate structured output. 493
agenta-ai/agenta An end-to-end platform for building and deploying large language model applications 1,287
langchain-ai/langserve Provides a REST API for deploying and managing LangChain runnables and chains 1,944
opengvlab/lamm A framework and benchmark for training and evaluating multi-modal large language models, enabling the development of AI agents capable of seamless interaction between humans and machines. 301
googlecloudplatform/click-to-deploy A collection of pre-configured environments for deploying various Google Cloud-based applications 730
internlm/lagent A lightweight framework for building agent-based applications using LLMs and transformer architectures 1,865
farama-foundation/magent2 A library for creating high-performance environments for training large numbers of competing agents in multi-agent scenarios 229
accessd/slack-deploy-bot A Slack bot that automates the deployment process of web applications to various environments. 38
azure/aro-landing-zone-accelerator Provides an architectural approach and reference implementation to deploy workload platforms on Azure at scale 46
ai-hypercomputer/maxtext A high-performance LLM written in Python/Jax for training and inference on Google Cloud TPUs and GPUs. 1,529