IF

Image generator

A text-to-image synthesis model with a modular design, utilizing a frozen text encoder and cascaded pixel diffusion modules to generate photorealistic images.

GitHub

8k stars
84 watching
499 forks
Language: Python
last commit: 7 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
compvis/stable-diffusion A text-to-image model trained on images and text prompts using a diffusion process 68,368
lucidrains/dalle2-pytorch An implementation of DALL-E 2's text-to-image synthesis neural network in PyTorch 11,148
ai-forever/kandinsky-2 A multilingual text2image latent diffusion model with improved aesthetics and controllability 2,769
ashawkey/stable-dreamfusion Generates 3D content from text using a combination of neural networks and image synthesis. 8,312
doubiiu/dynamicrafter This project generates animated videos from open-domain images by leveraging pre-trained video diffusion priors. 2,596
stability-ai/stablediffusion A software project that enables high-resolution image synthesis through a specific type of generative model using latent diffusion processes. 39,192
modelscope/diffsynth-studio A software framework for training and utilizing various types of diffusion models. 6,599
openai/glide-text2im A diffusion-based text-conditional image synthesis model 3,551
nvidia/pix2pixhd Generates photorealistic images from conditional inputs using deep neural networks 6,667
microsoft/deepspeed A deep learning optimization library that makes distributed training and inference easy, efficient, and effective. 35,545
dmitryulyanov/deep-image-prior A project demonstrating image restoration using neural networks without learning 7,886
luodian/otter A multi-modal AI model developed for improved instruction-following and in-context learning, utilizing large-scale architectures and various training datasets. 3,563
xpixelgroup/diffbir Generative model for blind image restoration and other related tasks using diffusion prior 3,395
doubiiu/tooncrafter Generates cartoon-style videos from two images using pre-trained diffusion models 5,372
openai/clip A neural network trained on image and text pairs to predict the most relevant text snippet given an image 26,061