MixEval
The official evaluation suite and dynamic data release for MixEval.
203 stars
1 watching
29 forks
Language: Python
last commit: 17 days ago
Linked from 1 awesome list
benchmarkbenchmark-mixturebenchmarking-frameworkbenchmarking-suiteevaluationevaluation-frameworkfoundation-modelslarge-language-modellarge-language-modelslarge-multimodal-modelsllm-evaluationllm-evaluation-frameworkllm-inferencemixeval