evals
Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.
18 stars
0 watching
3 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list
Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.