IF

Image generator

A text-to-image synthesis model with a modular design, utilizing a frozen text encoder and cascaded pixel diffusion modules to generate photorealistic images.

GitHub

8k stars
84 watching
504 forks
Language: Python
last commit: 8 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
compvis/stable-diffusion A text-to-image model trained on images and text prompts using a diffusion process 68,750
lucidrains/dalle2-pytorch An implementation of DALL-E 2's text-to-image synthesis neural network in PyTorch 11,184
ai-forever/kandinsky-2 A multilingual text2image latent diffusion model with improved aesthetics and controllability 2,774
ashawkey/stable-dreamfusion Generates 3D content from text using a combination of neural networks and image synthesis. 8,351
doubiiu/dynamicrafter This project generates animated videos from open-domain images by leveraging pre-trained video diffusion priors. 2,668
stability-ai/stablediffusion A software project that enables high-resolution image synthesis through a specific type of generative model using latent diffusion processes. 39,501
modelscope/diffsynth-studio A software framework for training and utilizing various types of diffusion models. 6,641
openai/glide-text2im A diffusion-based text-conditional image synthesis model 3,562
nvidia/pix2pixhd Generates photorealistic images from conditional inputs using deep neural networks 6,685
microsoft/deepspeed A deep learning optimization library that simplifies distributed training and inference on modern computing hardware. 35,863
dmitryulyanov/deep-image-prior A project demonstrating image restoration using neural networks without learning 7,920
luodian/otter A multi-modal AI model developed for improved instruction-following and in-context learning, utilizing large-scale architectures and various training datasets. 3,570
xpixelgroup/diffbir This project provides a deep learning-based pipeline for restoring degraded images 3,445
doubiiu/tooncrafter Generates cartoon-style videos from two images using pre-trained diffusion models 5,447
openai/clip A neural network trained on image and text pairs to predict the most relevant text snippet given an image 26,460