MathVista

Math Reasoning Evaluation Platform

Evaluating mathematical reasoning in visual contexts using large language models and multimodal AI

MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts

GitHub

253 stars
6 watching
39 forks
Language: Jupyter Notebook
last commit: about 2 months ago
ai4mathlarge-language-modelslarge-multimadality-modelsmachine-learningmathematicsmathqasciencevisual-question-answering

Related projects:

Repository Description Stars
mathllm/math-v A dataset and code framework to evaluate the ability of Large Multimodal Models (LMMs) to reason mathematically with visual contexts. 74
lupantech/scienceqa A dataset and software framework for building multimodal reasoning systems to answer science questions. 615
metadelta/mdlt A command-line utility for performing arithmetic and symbolic math operations. 178
open-compass/vlmevalkit An evaluation toolkit for large vision-language models 1,514
5anthosh/fcal A math expression evaluator library that allows users to perform calculations with precision and various units, functions, and constants. 110
willem-j-an/visidata.nvim Enables visualizing pandas dataframes in Neovim using Visidata 26
jy0205/lavit A unified framework for training large language models to understand and generate visual content 544
nvlabs/relvit A deep learning framework designed to improve visual reasoning capabilities by utilizing concepts and semantic relations. 64
jnhwkim/nips-mrn-vqa This project presents a neural network model designed to answer visual questions by combining question and image features in a residual learning framework. 39
davidmascharka/tbd-nets An open-source implementation of a deep learning model designed to improve the balance between performance and interpretability in visual reasoning tasks. 348
lupantech/chameleon-llm Develops a framework to generate responses by composing various tools with large language models. 1,095
airaria/visual-chinese-llama-alpaca Develops a multimodal Chinese language model with visual capabilities 429
lirios/calculator A cross-platform calculator built with QML and Material Design 29
rucaibox/comvint Creating synthetic visual reasoning instructions to improve the performance of large language models on image-related tasks 18
lxtgh/omg-seg Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model. 1,336