M3CoT

Chain-of-Thought benchmark

A benchmarking framework for multi-modal Chain-of-Thought models that evaluates their performance on step-by-step reasoning tasks.

47 stars

2 watching

2 forks

Language: Python

last commit: about 2 years ago