IF
Image generator
A text-to-image synthesis model with a modular design, utilizing a frozen text encoder and cascaded pixel diffusion modules to generate photorealistic images.
8k stars
84 watching
499 forks
Language: Python
last commit: 7 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
compvis/stable-diffusion | A text-to-image model trained on images and text prompts using a diffusion process | 68,368 |
lucidrains/dalle2-pytorch | An implementation of DALL-E 2's text-to-image synthesis neural network in PyTorch | 11,148 |
ai-forever/kandinsky-2 | A multilingual text2image latent diffusion model with improved aesthetics and controllability | 2,769 |
ashawkey/stable-dreamfusion | Generates 3D content from text using a combination of neural networks and image synthesis. | 8,312 |
doubiiu/dynamicrafter | This project generates animated videos from open-domain images by leveraging pre-trained video diffusion priors. | 2,596 |
stability-ai/stablediffusion | A software project that enables high-resolution image synthesis through a specific type of generative model using latent diffusion processes. | 39,192 |
modelscope/diffsynth-studio | A software framework for training and utilizing various types of diffusion models. | 6,599 |
openai/glide-text2im | A diffusion-based text-conditional image synthesis model | 3,551 |
nvidia/pix2pixhd | Generates photorealistic images from conditional inputs using deep neural networks | 6,667 |
microsoft/deepspeed | A deep learning optimization library that makes distributed training and inference easy, efficient, and effective. | 35,545 |
dmitryulyanov/deep-image-prior | A project demonstrating image restoration using neural networks without learning | 7,886 |
luodian/otter | A multi-modal AI model developed for improved instruction-following and in-context learning, utilizing large-scale architectures and various training datasets. | 3,563 |
xpixelgroup/diffbir | Generative model for blind image restoration and other related tasks using diffusion prior | 3,395 |
doubiiu/tooncrafter | Generates cartoon-style videos from two images using pre-trained diffusion models | 5,372 |
openai/clip | A neural network trained on image and text pairs to predict the most relevant text snippet given an image | 26,061 |