StoryDiffusion

Narrative generator

This project provides an implementation of a self-attention mechanism for long-range image and video generation, enabling the creation of consistent narratives.

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

GitHub

6k stars
86 watching
597 forks
Language: Jupyter Notebook
last commit: about 2 months ago

Related projects:

Repository Description Stars
cumulo-autumn/streamdiffusion A pipeline-level solution for real-time interactive image generation using diffusion-based techniques 9,736
doubiiu/dynamicrafter This project generates animated videos from open-domain images by leveraging pre-trained video diffusion priors. 2,580
nateraw/stable-diffusion-videos Generates videos by manipulating the latent space of Stable Diffusion to create morphing sequences from text prompts. 4,453
ashawkey/stable-dreamfusion Generates 3D content from text using a combination of neural networks and image synthesis. 8,296
stability-ai/stablediffusion A software project that enables high-resolution image synthesis through a specific type of generative model using latent diffusion processes. 39,192
doubiiu/tooncrafter Generates cartoon-style videos from two images using pre-trained diffusion models 5,353
nvlabs/instant-ngp A software toolkit for training and rendering neural graphics primitives 16,033
thudm/cogvideo Generates videos from text and images using large language models 9,156
lkwq007/stablediffusion-infinity An application that uses Stable Diffusion to generate outpainting results on an infinite canvas 3,850
zejun-yang/aniportrait An open-source framework for generating photorealistic animations driven by audio and reference images. 4,642
junyanz/igan Interactive image generation using Generative Adversarial Networks to satisfy user edits in real-time 3,977
jina-ai/dalle-flow An interactive workflow for generating high-definition images from text prompts using a human-in-the-loop approach 2,834
mkocabas/vibe A video pose and shape estimation method that predicts body parameters for each frame of an input video. 2,897
sygil-dev/sygil-webui A web-based UI for Stable Diffusion image generation 7,875
pkuliyi2015/multidiffusion-upscaler-for-automatic1111 A Python-based image generation tool utilizing advanced diffusion techniques to upscale or generate large images with limited VRAM resources. 4,768