TimeChat
Video understanding model
A large language model designed to understand and process long videos with temporal information
[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding
286 stars
5 watching
25 forks
Language: Python
last commit: 6 months ago Related projects:
Repository | Description | Stars |
---|---|---|
rese1f/moviechat | A deep learning model designed to efficiently process and analyze long videos using large language models | 525 |
pku-yuangroup/video-bench | Evaluates and benchmarks large language models' video understanding capabilities | 117 |
mbzuai-oryx/video-chatgpt | A video conversation model that generates meaningful conversations about videos using large vision and language models | 1,213 |
yuangongnd/ltu | An audio and speech large language model implementation with pre-trained models, datasets, and inference options | 385 |
llyx97/tempcompass | A tool to evaluate video language models' ability to understand and describe video content | 84 |
liuzhao1225/youdub-webui | A web-based video processing tool that uses AI to facilitate cultural and linguistic tasks such as transcription, translation, and audio synthesis. | 1,940 |
shangwei5/vidue | A deep learning model that jointly performs video frame interpolation and deblurring with unknown exposure time | 66 |
boheumd/ma-lmm | This project develops an AI model for long-term video understanding | 244 |
kendryte/toucan-llm | A large language model with 70 billion parameters designed for chatbot and conversational AI tasks | 29 |
thu-coai/opd | A large-scale pre-trained dialogue model for Chinese language | 74 |
lightyear-turing/turingmm-34b-chat | An English-Chinese chat model developed from a large language model for conversational AI | 9 |
pku-yuangroup/chronomagic-bench | A benchmark and dataset for evaluating text-to-video generation models' ability to generate coherent and varied metamorphic time-lapse videos. | 186 |
yunwentechnology/unilm | This project provides pre-trained models for natural language understanding and generation tasks using the UniLM architecture. | 438 |
brightmart/xlnet_zh | Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks | 230 |
andrewnguonly/chatabstractions | Provides a framework for creating custom chat models with dynamic failover and load balancing features | 79 |