agenta
LLM platform
An end-to-end platform for building and deploying large language model applications
The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM Observability all in one place.
2k stars
21 watching
213 forks
Language: Python
last commit: 11 months ago
Linked from 2 awesome lists
llm-as-a-judgellm-evaluationllm-frameworkllm-monitoringllm-observabilityllm-platformllm-playgroundllm-toolsllmops-platformprompt-engineeringprompt-managementrag-evaluation
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | Builds agents controlled by large language models (LLMs) to perform tasks with tool-based components | 940 |
| | A collection of papers on the use of large language models in agent-based systems | 1,960 |
| | A comprehensive toolset for building Large Language Model (LLM) based applications | 1,733 |
| | A benchmark suite for evaluating the ability of large language models to operate as autonomous agents in various environments | 2,272 |
| | A low-code development tool for building multi-agent large language models applications | 1,039 |
| | A lightweight framework for building agent-based applications using LLMs and transformer architectures | 1,924 |
| | A tool for easy interaction with Large Language Models (LLMs) to execute structured function calls and generate structured output. | 505 |
| | A framework and benchmark for training and evaluating multi-modal large language models, enabling the development of AI agents capable of seamless interaction between humans and machines. | 305 |
| | A tool for building and deploying generative AI applications with a no-code multi-agent framework | 1,659 |
| | An open-source toolkit for building and evaluating large language models | 267 |
| | An event-driven developer platform for building and running large language model AI apps | 398 |
| | Automates large code generation and writing tasks using a large language model framework | 79 |
| | Compiles and curates research papers on LLM-based agent architectures and applications | 1,127 |
| | A benchmark for evaluating large language models in multiple languages and formats | 93 |
| | A high-performance LLM written in Python/Jax for training and inference on Google Cloud TPUs and GPUs. | 1,557 |