StoryDiffusion
Narrative generator
This project provides an implementation of a self-attention mechanism for long-range image and video generation, enabling the creation of consistent narratives.
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
6k stars
86 watching
597 forks
Language: Jupyter Notebook
last commit: about 2 months ago Related projects:
Repository | Description | Stars |
---|---|---|
cumulo-autumn/streamdiffusion | A pipeline-level solution for real-time interactive image generation using diffusion-based techniques | 9,736 |
doubiiu/dynamicrafter | This project generates animated videos from open-domain images by leveraging pre-trained video diffusion priors. | 2,580 |
nateraw/stable-diffusion-videos | Generates videos by manipulating the latent space of Stable Diffusion to create morphing sequences from text prompts. | 4,453 |
ashawkey/stable-dreamfusion | Generates 3D content from text using a combination of neural networks and image synthesis. | 8,296 |
stability-ai/stablediffusion | A software project that enables high-resolution image synthesis through a specific type of generative model using latent diffusion processes. | 39,192 |
doubiiu/tooncrafter | Generates cartoon-style videos from two images using pre-trained diffusion models | 5,353 |
nvlabs/instant-ngp | A software toolkit for training and rendering neural graphics primitives | 16,033 |
thudm/cogvideo | Generates videos from text and images using large language models | 9,156 |
lkwq007/stablediffusion-infinity | An application that uses Stable Diffusion to generate outpainting results on an infinite canvas | 3,850 |
zejun-yang/aniportrait | An open-source framework for generating photorealistic animations driven by audio and reference images. | 4,642 |
junyanz/igan | Interactive image generation using Generative Adversarial Networks to satisfy user edits in real-time | 3,977 |
jina-ai/dalle-flow | An interactive workflow for generating high-definition images from text prompts using a human-in-the-loop approach | 2,834 |
mkocabas/vibe | A video pose and shape estimation method that predicts body parameters for each frame of an input video. | 2,897 |
sygil-dev/sygil-webui | A web-based UI for Stable Diffusion image generation | 7,875 |
pkuliyi2015/multidiffusion-upscaler-for-automatic1111 | A Python-based image generation tool utilizing advanced diffusion techniques to upscale or generate large images with limited VRAM resources. | 4,768 |