agenta

LLM platform

An end-to-end platform for building and deploying large language model applications

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM Observability all in one place.

GitHub

1k stars
20 watching
188 forks
Language: Python
last commit: about 19 hours ago
Linked from 2 awesome lists

llm-as-a-judgellm-evaluationllm-frameworkllm-monitoringllm-observabilityllm-platformllm-playgroundllm-toolsllmops-platformprompt-engineeringprompt-managementrag-evaluation

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
mpaepper/llm_agents Builds agents controlled by large language models (LLMs) to perform tasks with tool-based components 931
zjunlp/llmagentpapers A collection of papers on the use of large language models in agent-based systems 1,852
melih-unsal/demogpt A comprehensive toolset for building Large Language Model (LLM) based applications 1,710
thudm/agentbench A benchmark suite for evaluating the ability of large language models to operate as autonomous agents in various environments 2,222
lazyagi/lazyllm A low-code development tool for building multi-agent large language models applications 1,020
internlm/lagent A lightweight framework for building agent-based applications using LLMs and transformer architectures 1,865
maximilian-winter/llama-cpp-agent A tool for easy interaction with Large Language Models (LLMs) to execute structured function calls and generate structured output. 493
opengvlab/lamm A framework and benchmark for training and evaluating multi-modal large language models, enabling the development of AI agents capable of seamless interaction between humans and machines. 301
trypromptly/llmstack A tool for building and deploying generative AI applications with a no-code multi-agent framework 1,610
aiplanethub/beyondllm An open-source toolkit for building and evaluating large language models 263
langstream/langstream An event-driven developer platform for building and running large language model AI apps 393
samholt/l2mac Automates large code generation and writing tasks using a large language model framework 70
agi-edgerunners/llm-agents-papers Compiles and curates research papers on LLM-based agent architectures and applications 1,092
damo-nlp-sg/m3exam A benchmark for evaluating large language models in multiple languages and formats 92
ai-hypercomputer/maxtext A high-performance LLM written in Python/Jax for training and inference on Google Cloud TPUs and GPUs. 1,529