llama_deploy
Deployment framework
An async-first framework for deploying and scaling agentic multi-service systems
Deploy your agentic worfklows to production
2k stars
25 watching
189 forks
Language: Python
last commit: 9 days ago
Linked from 1 awesome list
agentsdeploymentframeworkllamaindexllmmulti-agents
Related projects:
Repository | Description | Stars |
---|---|---|
run-llama/llamaindexts | A data framework for integrating large language models into applications with custom data | 1,960 |
talkdai/dialog | An application framework to simplify the deployment and testing of large language models (LLMs) for natural language processing tasks. | 377 |
soulteary/docker-llama2-chat | An implementation of LLaMA2 in Docker, allowing users to quickly deploy and run the model on their local machine. | 536 |
balavenkatesh3322/model_deployment | Provides tools and frameworks for deploying machine learning models in production environments | 73 |
snunez1/llama.cl | A Common Lisp port of a Large Language Model (LLM) implementation | 35 |
maximilian-winter/llama-cpp-agent | A tool for easy interaction with Large Language Models (LLMs) to execute structured function calls and generate structured output. | 493 |
agenta-ai/agenta | An end-to-end platform for building and deploying large language model applications | 1,287 |
langchain-ai/langserve | Provides a REST API for deploying and managing LangChain runnables and chains | 1,944 |
opengvlab/lamm | A framework and benchmark for training and evaluating multi-modal large language models, enabling the development of AI agents capable of seamless interaction between humans and machines. | 301 |
googlecloudplatform/click-to-deploy | A collection of pre-configured environments for deploying various Google Cloud-based applications | 730 |
internlm/lagent | A lightweight framework for building agent-based applications using LLMs and transformer architectures | 1,865 |
farama-foundation/magent2 | A library for creating high-performance environments for training large numbers of competing agents in multi-agent scenarios | 229 |
accessd/slack-deploy-bot | A Slack bot that automates the deployment process of web applications to various environments. | 38 |
azure/aro-landing-zone-accelerator | Provides an architectural approach and reference implementation to deploy workload platforms on Azure at scale | 46 |
ai-hypercomputer/maxtext | A high-performance LLM written in Python/Jax for training and inference on Google Cloud TPUs and GPUs. | 1,529 |