Video-MME
Video analysis benchmark
Comprehensive benchmark for evaluating multi-modal large language models on video analysis tasks
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
422 stars
5 watching
17 forks
last commit: 10 months ago large-language-modelslarge-vision-language-modelsmmemultimodal-large-language-modelsvideovideo-mme
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | An open-source benchmarking framework for evaluating cross-style visual capability of large multimodal models | 84 |
| | Evaluates and benchmarks large language models' video understanding capabilities | 121 |
| | Develops a method for long video understanding by optimizing memory usage | 550 |
| | An evaluation framework for multimodal language models' visual capabilities using image and question benchmarks. | 296 |
| | Measures the understanding of massive multitask Chinese datasets using large language models | 87 |
| | This project develops an AI model for long-term video understanding | 254 |
| | A multimodal large language model benchmark designed to simulate real-world challenges and measure the performance of such models in practical scenarios. | 86 |
| | Provides a benchmarking framework for evaluating the quality of text-to-video generation models | 191 |
| | A collection of resources and tools for video analysis using deep learning and multi-modal learning techniques. | 767 |
| | A benchmark for evaluating the performance of multimodal question answering models on diverse domains and data types | 46 |
| | A multimedia framework designed for video editing, providing tools and libraries for audio and video processing. | 1,522 |
| | A benchmark for evaluating large language models in multiple languages and formats | 93 |
| | Develops a cross-modal architecture for video retrieval by combining multiple types of features from videos and text | 259 |
| | A benchmarking framework for evaluating Large Multimodal Models by providing rigorous metrics and an efficient evaluation pipeline. | 22 |
| | Software to interpolate blurry video frames and enhance image quality | 209 |