Kolors
Text-to-Image Synthesis Framework
A Python framework for training and deploying photorealistic text-to-image synthesis models.
Kolors Team
4k stars
39 watching
268 forks
Language: Python
last commit: 11 days ago Related projects:
Repository | Description | Stars |
---|---|---|
lucidrains/dalle2-pytorch | An implementation of DALL-E 2's text-to-image synthesis neural network in PyTorch | 11,148 |
cszn/kair | Image restoration toolbox with training and testing codes for various deep learning-based methods | 2,968 |
ai-forever/kandinsky-2 | A multilingual text2image latent diffusion model with improved aesthetics and controllability | 2,769 |
transcranial/keras-js | An open source JavaScript library that allows running Keras models in the browser with GPU support using WebGL. | 4,963 |
huggingface/diffusers | A PyTorch-based library for training and using state-of-the-art diffusion models to generate images, audio, and 3D structures | 26,223 |
clovaai/stargan-v2 | A Python implementation of an image-to-image translation model for generating diverse images across multiple domains. | 3,506 |
doubiiu/dynamicrafter | This project generates animated videos from open-domain images by leveraging pre-trained video diffusion priors. | 2,596 |
leoxiaobin/deep-high-resolution-net.pytorch | An implementation of a deep learning network for human pose estimation using high-resolution representations | 4,327 |
lllyasviel/controlnet | An implementation of a neural network structure to control diffusion models by adding extra conditions. | 30,399 |
openai/guided-diffusion | This project is a software implementation of a diffusion model architecture, allowing users to generate synthetic images based on a learned distribution. | 6,293 |
modelscope/diffsynth-studio | A software framework for training and utilizing various types of diffusion models. | 6,599 |
tencent/hunyuandit | A PyTorch-based diffusion transformer model for generating images with fine-grained Chinese understanding and text-to-image synthesis | 3,456 |
yangxy/gpen | An open-source software project that develops and provides tools for restoring degraded faces in images using Generative Pre-trained Networks. | 2,441 |
borisdayma/dalle-mini | Generates images from text prompts using a variant of the DALL-E model | 14,751 |
dome272/wuerstchen | A framework that enables efficient training of text-to-image models by compressing the computationally expensive stage into a latent space | 528 |