M3CoT
Chain-of-Thought benchmark
A benchmarking framework for multi-modal Chain-of-Thought models that evaluates their performance on step-by-step reasoning tasks.
47 stars
2 watching
2 forks
Language: Python
last commit: 8 months ago A benchmarking framework for multi-modal Chain-of-Thought models that evaluates their performance on step-by-step reasoning tasks.