opik
LM testing platform
A platform for evaluating and testing large language models (LLMs) during development and production.
Open-source end-to-end LLM Development Platform
3k stars
38 watching
158 forks
Language: Java
last commit: 2 months ago
Linked from 9 awesome lists
Backlinks from these awesome lists:
-
josephmisiti/awesome-machine-learning
-
academic/awesome-datascience
-
hannibal046/awesome-llm
-
ethicalml/awesome-production-machine-learning
-
sindresorhus/awesome-chatgpt
-
kelvins/awesome-mlops
-
promptslab/awesome-prompt-engineering
-
jphall663/awesome-machine-learning-interpretability
-
agamm/awesome-developer-first
Related projects:
Repository | Description | Stars |
---|---|---|
| An extension of Vitest for testing LLM applications using local language models | 31 |
| A tool for managing load tests and analyzing performance results | 200 |
| A set of libraries and tools for efficient message passing and data marshalling in real-time systems. | 1,011 |
| An effort to develop and compare large language models beyond OpenGPT | 105 |
| A tool for testing and evaluating large language models with a focus on AI safety and model assessment. | 506 |
| An open-source implementation of a vision-language instructed large language model | 513 |
| A test library for Lua that supports declarative testing with nested contexts and code coverage reports. | 161 |
| A benchmarking framework for large language models | 81 |
| A web-based e-learning platform with features like assessment, content management, and learning resources, built using Java. | 337 |
| A React library designed to work with Large Language Models (LLMs) by providing features such as syntax removal, custom component addition, and rendering at a native frame rate. | 425 |
| A benchmark for evaluating large language models' ability to process multimodal input | 322 |
| A comprehensive toolset for building Large Language Model (LLM) based applications | 1,733 |
| An application framework to simplify the deployment and testing of large language models (LLMs) for natural language processing tasks. | 380 |
| Provides a set of tools and components to simplify the integration of Large Language Models into Dart/Flutter applications | 441 |
| A tool to automate the evaluation of large language models in Google Colab using various benchmarks and custom parameters. | 566 |