Kolors

Text-to-Image Synthesis Framework

A Python framework for training and deploying photorealistic text-to-image synthesis models.

Kolors Team

GitHub

4k stars
39 watching
268 forks
Language: Python
last commit: 11 days ago

Related projects:

Repository Description Stars
lucidrains/dalle2-pytorch An implementation of DALL-E 2's text-to-image synthesis neural network in PyTorch 11,148
cszn/kair Image restoration toolbox with training and testing codes for various deep learning-based methods 2,968
ai-forever/kandinsky-2 A multilingual text2image latent diffusion model with improved aesthetics and controllability 2,769
transcranial/keras-js An open source JavaScript library that allows running Keras models in the browser with GPU support using WebGL. 4,963
huggingface/diffusers A PyTorch-based library for training and using state-of-the-art diffusion models to generate images, audio, and 3D structures 26,223
clovaai/stargan-v2 A Python implementation of an image-to-image translation model for generating diverse images across multiple domains. 3,506
doubiiu/dynamicrafter This project generates animated videos from open-domain images by leveraging pre-trained video diffusion priors. 2,596
leoxiaobin/deep-high-resolution-net.pytorch An implementation of a deep learning network for human pose estimation using high-resolution representations 4,327
lllyasviel/controlnet An implementation of a neural network structure to control diffusion models by adding extra conditions. 30,399
openai/guided-diffusion This project is a software implementation of a diffusion model architecture, allowing users to generate synthetic images based on a learned distribution. 6,293
modelscope/diffsynth-studio A software framework for training and utilizing various types of diffusion models. 6,599
tencent/hunyuandit A PyTorch-based diffusion transformer model for generating images with fine-grained Chinese understanding and text-to-image synthesis 3,456
yangxy/gpen An open-source software project that develops and provides tools for restoring degraded faces in images using Generative Pre-trained Networks. 2,441
borisdayma/dalle-mini Generates images from text prompts using a variant of the DALL-E model 14,751
dome272/wuerstchen A framework that enables efficient training of text-to-image models by compressing the computationally expensive stage into a latent space 528