M3CoT

Chain-of-Thought benchmark

A benchmarking framework for multi-modal Chain-of-Thought models that evaluates their performance on step-by-step reasoning tasks.

GitHub

47 stars
2 watching
2 forks
Language: Python
last commit: 8 months ago