poyro

LLM Tester

An extension of Vitest for testing LLM applications using local language models

Test your web app LLM integrations using existing testing frameworks. Confidently launch AI-driven webapps to production.

GitHub

31 stars
2 watching
1 forks
Language: TypeScript
last commit: about 1 year ago
Linked from 1 awesome list

aievaluationllmllmopsnodejspromptprompt-engineeringpromptstestingvitest

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
comet-ml/opik A platform for evaluating and testing large language models (LLMs) during development and production. 2,588
innogames/ltc A tool for managing load tests and analyzing performance results 200
quolpr/quicktest.nvim A plugin for running and testing code in multiple programming languages 83
lingaro/azure-locust A tool for running distributed load tests on Azure Container Instances 30
oblador/loki Automated testing tool for React applications to ensure visual consistency and accuracy 1,795
helicone/helicone An all-in-one platform for monitoring and managing large language models 2,163
llm-ui-kit/llm-ui A React library designed to work with Large Language Models (LLMs) by providing features such as syntax removal, custom component addition, and rendering at a native frame rate. 425
norman/telescope A test library for Lua that supports declarative testing with nested contexts and code coverage reports. 161
victordibia/llmx An API that provides a unified interface to multiple large language models for chat fine-tuning 79
h2oai/h2o-llm-eval An evaluation framework for large language models with Elo rating system and A/B testing capabilities 50
pylons/webtest Allows testing of WSGI applications without setting up an HTTP server 337
vicampo/riposte A scripting language and toolset for testing JSON-based HTTP APIs 45
johnsnowlabs/langtest A tool for testing and evaluating large language models with a focus on AI safety and model assessment. 506
crewdevio/merlin A testing framework for deno that provides a set of matchers to assert conditions in tests. 50
llyx97/tempcompass A tool to evaluate video language models' ability to understand and describe video content 91