TimeChat

Video understanding model

A large language model designed to understand long videos by binding visual content with timestamps and producing video token sequences of varying lengths.

[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

GitHub

314 stars
5 watching
26 forks
Language: Python
last commit: about 2 months ago

Related projects:

Repository Description Stars
rese1f/moviechat Develops a method for long video understanding by optimizing memory usage 550
pku-yuangroup/video-bench Evaluates and benchmarks large language models' video understanding capabilities 121
mbzuai-oryx/video-chatgpt A video conversation model that generates meaningful conversations about videos using large vision and language models 1,246
yuangongnd/ltu An audio and speech large language model implementation with pre-trained models, datasets, and inference options 396
llyx97/tempcompass A tool to evaluate video language models' ability to understand and describe video content 91
liuzhao1225/youdub-webui A web-based video processing tool that uses AI to facilitate cultural and linguistic tasks such as transcription, translation, and audio synthesis. 1,980
shangwei5/vidue A deep learning model that jointly performs video frame interpolation and deblurring with unknown exposure time 69
boheumd/ma-lmm This project develops an AI model for long-term video understanding 254
kendryte/toucan-llm A large language model with 70 billion parameters designed for chatbot and conversational AI tasks 29
thu-coai/opd A large-scale pre-trained dialogue model for Chinese language 74
lightyear-turing/turingmm-34b-chat An English-Chinese chat model developed from a large language model for conversational AI 9
pku-yuangroup/chronomagic-bench Provides a benchmarking framework for evaluating the quality of text-to-video generation models 191
yunwentechnology/unilm This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese. 439
brightmart/xlnet_zh Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks 230
andrewnguonly/chatabstractions Provides a framework for creating custom chat models with dynamic failover and load balancing features 79