ChartMimic
chart evaluation tool
An open-source benchmarking project that evaluates large multimodal models' code generation capabilities via visually-grounded chart-to-code conversion
ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation
95 stars
6 watching
0 forks
Language: Python
last commit: almost 2 years ago artificial-intelligencecode-generationmultimodal
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | A multimodal LLM for understanding and generating charts in various formats. | 202 |
| | An evaluation suite for assessing chart understanding in multimodal large language models. | 85 |
| | Tools and evaluation framework for accelerating the development of large multimodal models by providing an efficient way to assess their performance | 2,164 |
| | Develops a large-scale dataset and benchmark for training multimodal chart understanding models using large language models. | 87 |
| | An interactive charting and analysis tool for financial instruments | 66 |
| | Evaluates and compares the performance of multimodal large language models on various tasks | 56 |
| | Evaluates and benchmarks multimodal language models' ability to process visual, acoustic, and textual inputs simultaneously. | 15 |
| | An evaluation framework for multimodal language models' visual capabilities using image and question benchmarks. | 296 |
| | A benchmark for evaluating large language models' ability to process multimodal input | 322 |
| | A Python library providing NLP tools and utilities built on top of spaCy for text processing and analysis. | 2,217 |
| | A benchmarking framework for evaluating Large Multimodal Models by providing rigorous metrics and an efficient evaluation pipeline. | 22 |
| | A package providing tools and metrics for evaluating Earth system models | 104 |
| | A QML-based implementation of charting components using Chart.js-like functionality | 140 |
| | Evaluates the capabilities of large multimodal models using a set of diverse tasks and metrics | 274 |
| | An evaluation toolkit for large vision-language models | 1,514 |