MagicTime
Video generator
Generates time-lapse videos from text inputs using deep learning models.
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
1k stars
21 watching
125 forks
Language: Python
last commit: 13 days ago diffusion-modelslong-video-generationmetamorphic-video-generationopen-sora-plantext-to-videotime-lapsetime-lapse-datasetvideo-generation
Related projects:
Repository | Description | Stars |
---|---|---|
pku-yuangroup/chronomagic-bench | Provides a benchmarking framework for evaluating the quality of text-to-video generation models | 191 |
showlab/show-1 | This project enables text-to-video generation using a combination of pixel and latent diffusion models. | 1,110 |
pku-yuangroup/open-sora-dataset | A large video dataset collected from various open-source websites for use in computer vision and multimedia applications. | 94 |
singularity42/vgan-tensorflow | An implementation of a deep learning model to generate videos with dynamic scenes | 15 |
pku-yuangroup/video-bench | Evaluates and benchmarks large language models' video understanding capabilities | 121 |
researchmm/sttn | Proposes a deep learning model to fill missing regions in video frames and generate completed videos | 480 |
damo-nlp-sg/videollama2 | An audio-visual language model designed to advance spatial-temporal modeling and audio understanding in video processing. | 957 |
eps696/aphantasia | A text-to-image tool using CLIP and FFT/DWT parameters to generate detailed images from user-provided text prompts. | 778 |
tmetsch/pytkgen | A Python module that allows defining GUIs in JSON files and generating Tkinter widgets from them | 120 |
yuweihao/kern | An open-source implementation of a graph neural network architecture for scene graph generation in computer vision | 121 |
showlab/vlog | Transforms video content into a long document containing visual and audio information that can be used for chat or other applications. | 546 |
renshuhuai-andy/timechat | A large language model designed to understand long videos by binding visual content with timestamps and producing video token sequences of varying lengths. | 314 |
mbzuai-oryx/video-chatgpt | A video conversation model that generates meaningful conversations about videos using large vision and language models | 1,246 |
zsdonghao/text-to-image | A TensorFlow implementation of generating images from text descriptions using a Generative Adversarial Network (GAN) architecture | 602 |
kronoscode/django-magicembed | Provides a tool to easily embed videos and generate thumbnails in Django web applications. | 19 |