MMC

Chart model trainer

Develops a large-scale dataset and benchmark for training multimodal chart understanding models using large language models.

[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning

GitHub

87 stars
6 watching
3 forks
Language: Python
last commit: 4 months ago
arxivbenchmarkchartdatasetgptinstruction-tuningllavaminigpt4mplug-owlmultimodalotterresourcestock

Related projects:

Repository Description Stars
felixgithub2017/mmcu Measures the understanding of massive multitask Chinese datasets using large language models 87
haozhezhao/mic Develops a multimodal vision-language model to enable machines to understand complex relationships between instructions and images in various tasks. 337
ailab-cvc/seed-bench A benchmark for evaluating large language models' ability to process multimodal input 322
open-mmlab/mmengine Provides a flexible and configurable framework for training deep learning models with PyTorch. 1,196
ailab-cvc/seed An implementation of a multimodal language model with capabilities for comprehension and generation 585
fukuball/fuku-ml An easy-to-use machine learning library with various algorithms for classification and regression tasks. 281
giuseppec/iml Provides methods to interpret and explain the behavior of machine learning models 494
pleisto/yuren-baichuan-7b A multi-modal large language model that integrates natural language and visual capabilities with fine-tuning for various tasks 73
yuliang-liu/monkey An end-to-end image captioning system that uses large multi-modal models and provides tools for training, inference, and demo usage. 1,849
minimaxir/automl-gs Automates machine learning model creation and optimization for complex datasets 1,857
yuweihao/mm-vet Evaluates the capabilities of large multimodal models using a set of diverse tasks and metrics 274
freedomintelligence/mllm-bench Evaluates and compares the performance of multimodal large language models on various tasks 56
mosecorg/mosec A high-performance ML model serving framework 802
tingxueronghua/chartllama-code A multimodal LLM for understanding and generating charts in various formats. 202
x-plug/mplug-halowl Evaluates and mitigates hallucinations in multimodal large language models 82