TimeChat

Video understanding model

A large language model designed to understand and process long videos with temporal information

[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

GitHub

286 stars
5 watching
25 forks
Language: Python
last commit: 6 months ago

Related projects:

Repository Description Stars
rese1f/moviechat A deep learning model designed to efficiently process and analyze long videos using large language models 525
pku-yuangroup/video-bench Evaluates and benchmarks large language models' video understanding capabilities 117
mbzuai-oryx/video-chatgpt A video conversation model that generates meaningful conversations about videos using large vision and language models 1,213
yuangongnd/ltu An audio and speech large language model implementation with pre-trained models, datasets, and inference options 385
llyx97/tempcompass A tool to evaluate video language models' ability to understand and describe video content 84
liuzhao1225/youdub-webui A web-based video processing tool that uses AI to facilitate cultural and linguistic tasks such as transcription, translation, and audio synthesis. 1,940
shangwei5/vidue A deep learning model that jointly performs video frame interpolation and deblurring with unknown exposure time 66
boheumd/ma-lmm This project develops an AI model for long-term video understanding 244
kendryte/toucan-llm A large language model with 70 billion parameters designed for chatbot and conversational AI tasks 29
thu-coai/opd A large-scale pre-trained dialogue model for Chinese language 74
lightyear-turing/turingmm-34b-chat An English-Chinese chat model developed from a large language model for conversational AI 9
pku-yuangroup/chronomagic-bench A benchmark and dataset for evaluating text-to-video generation models' ability to generate coherent and varied metamorphic time-lapse videos. 187
yunwentechnology/unilm This project provides pre-trained models for natural language understanding and generation tasks using the UniLM architecture. 438
brightmart/xlnet_zh Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks 230
andrewnguonly/chatabstractions Provides a framework for creating custom chat models with dynamic failover and load balancing features 79