Video-Bench

Video benchmarking toolkit

Evaluates and benchmarks large language models' video understanding capabilities

A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!

GitHub

121 stars

3 watching

2 forks

Language: Python

last commit: over 2 years ago

benchmarklarge-language-modelstoolkit

Related projects:

Repository	Description	Stars
pku-yuangroup/chronomagic-bench	Provides a benchmarking framework for evaluating the quality of text-to-video generation models	191
pku-yuangroup/languagebind	Extending pretraining models to handle multiple modalities by aligning language and video representations	751
pku-yuangroup/open-sora-dataset	A large video dataset collected from various open-source websites for use in computer vision and multimedia applications.	94
felixgithub2017/mmcu	Measures the understanding of massive multitask Chinese datasets using large language models	87
shawn-ieitsystems/yuan-1.0	Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing	591
renshuhuai-andy/timechat	A large language model designed to understand long videos by binding visual content with timestamps and producing video token sequences of varying lengths.	314
antoine77340/howto100m	Provides code and tools for learning joint text-video embeddings using the HowTo100M dataset	254
huaizhengzhang/awsome-deep-learning-for-video-analysis	A collection of resources and tools for video analysis using deep learning and multi-modal learning techniques.	767
tencent/tencent-hunyuan-large	This project makes a large language model accessible for research and development	1,245
bradyfu/video-mme	Comprehensive benchmark for evaluating multi-modal large language models on video analysis tasks	422
jshilong/gpt4roi	Training and deploying large language models on computer vision tasks using region-of-interest inputs	517
opengvlab/internvideo	Develops general video foundation models and related datasets for multimodal understanding and generation through generative and discriminative learning.	1,467
aliaksandrsiarohin/video-preprocessing	Tools for preprocessing videos for various datasets, including video cropping and annotation.	522
tianyi-lab/hallusionbench	An image-context reasoning benchmark designed to challenge large vision-language models and help improve their accuracy	259
qcri/llmebench	A benchmarking framework for large language models	81