MixEval

The official evaluation suite and dynamic data release for MixEval.

GitHub

203 stars
1 watching
29 forks
Language: Python
last commit: 17 days ago
Linked from 1 awesome list

benchmarkbenchmark-mixturebenchmarking-frameworkbenchmarking-suiteevaluationevaluation-frameworkfoundation-modelslarge-language-modellarge-language-modelslarge-multimodal-modelsllm-evaluationllm-evaluation-frameworkllm-inferencemixeval

Backlinks from these awesome lists: