MathVista

Math Reasoning Evaluation Platform

Evaluating mathematical reasoning in visual contexts using large language models and multimodal AI

MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts

GitHub

253 stars

6 watching

39 forks

Language: Jupyter Notebook

last commit: over 1 year ago

ai4mathlarge-language-modelslarge-multimadality-modelsmachine-learningmathematicsmathqasciencevisual-question-answering

Screenshot of lupantech/MathVista website

mathvista.github.io/

Related projects:

Repository	Description	Stars
mathllm/math-v	A dataset and code framework to evaluate the ability of Large Multimodal Models (LMMs) to reason mathematically with visual contexts.	74
lupantech/scienceqa	A dataset and software framework for building multimodal reasoning systems to answer science questions.	615
metadelta/mdlt	A command-line utility for performing arithmetic and symbolic math operations.	178
open-compass/vlmevalkit	An evaluation toolkit for large vision-language models	1,514
5anthosh/fcal	A math expression evaluator library that allows users to perform calculations with precision and various units, functions, and constants.	110
willem-j-an/visidata.nvim	Enables visualizing pandas dataframes in Neovim using Visidata	26
jy0205/lavit	A unified framework for training large language models to understand and generate visual content	544
nvlabs/relvit	A deep learning framework designed to improve visual reasoning capabilities by utilizing concepts and semantic relations.	64
jnhwkim/nips-mrn-vqa	This project presents a neural network model designed to answer visual questions by combining question and image features in a residual learning framework.	39
davidmascharka/tbd-nets	An open-source implementation of a deep learning model designed to improve the balance between performance and interpretability in visual reasoning tasks.	348
lupantech/chameleon-llm	Develops a framework to generate responses by composing various tools with large language models.	1,095
airaria/visual-chinese-llama-alpaca	Develops a multimodal Chinese language model with visual capabilities	429
lirios/calculator	A cross-platform calculator built with QML and Material Design	29
rucaibox/comvint	Creating synthetic visual reasoning instructions to improve the performance of large language models on image-related tasks	18
lxtgh/omg-seg	Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model.	1,336