agenta
LLM platform
An end-to-end platform for building and deploying large language model applications
The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM Observability all in one place.
2k stars
21 watching
213 forks
Language: Python
last commit: 2 months ago
Linked from 2 awesome lists
llm-as-a-judgellm-evaluationllm-frameworkllm-monitoringllm-observabilityllm-platformllm-playgroundllm-toolsllmops-platformprompt-engineeringprompt-managementrag-evaluation
Related projects:
Repository | Description | Stars |
---|---|---|
| Builds agents controlled by large language models (LLMs) to perform tasks with tool-based components | 940 |
| A collection of papers on the use of large language models in agent-based systems | 1,960 |
| A comprehensive toolset for building Large Language Model (LLM) based applications | 1,733 |
| A benchmark suite for evaluating the ability of large language models to operate as autonomous agents in various environments | 2,272 |
| A low-code development tool for building multi-agent large language models applications | 1,039 |
| A lightweight framework for building agent-based applications using LLMs and transformer architectures | 1,924 |
| A tool for easy interaction with Large Language Models (LLMs) to execute structured function calls and generate structured output. | 505 |
| A framework and benchmark for training and evaluating multi-modal large language models, enabling the development of AI agents capable of seamless interaction between humans and machines. | 305 |
| A tool for building and deploying generative AI applications with a no-code multi-agent framework | 1,659 |
| An open-source toolkit for building and evaluating large language models | 267 |
| An event-driven developer platform for building and running large language model AI apps | 398 |
| Automates large code generation and writing tasks using a large language model framework | 79 |
| Compiles and curates research papers on LLM-based agent architectures and applications | 1,127 |
| A benchmark for evaluating large language models in multiple languages and formats | 93 |
| A high-performance LLM written in Python/Jax for training and inference on Google Cloud TPUs and GPUs. | 1,557 |