MLLM-Bench

MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria

GitHub

50 stars
10 watching
3 forks
Language: Python
last commit: about 2 months ago