evals

Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.

GitHub

18 stars
0 watching
3 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list


Backlinks from these awesome lists: