unleashing-transformers

Image generator

A method for generating high-resolution images using a novel combination of vector quantization and transformer models

Code for the ECCV 2022 paper "Unleashing Transformers"

GitHub

182 stars
7 watching
23 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
thudm/cogview A framework for generating images from text using transformers. 1,735
pixart-alpha/pixart-sigma Develops a PyTorch model for 4K text-to-image generation using diffusion transformer 1,711
ibm/max-fast-neural-style-transfer A service for generating new images by mixing the content of an input image with the style of another image. 51
pixboost/transformimgs An image transformation API that resizes and optimises images based on user input without requiring additional parameters. 235
carpedm20/began-pytorch An implementation of a deep learning model for generating images from text descriptions 262
soumith/dcgan.torch A PyTorch implementation of a deep generative model that can be used to generate images from a dataset. 1,466
ypxie/hdgan A PyTorch implementation of an adversarial network for generating high-definition images from text descriptions. 150
taoxugit/attngan Reproduces text-to-image generation with attentional generative adversarial networks. 1,343
eps696/aphantasia A text-to-image tool using CLIP and FFT/DWT parameters to generate detailed images from user-provided text prompts. 778
woozzu/dong_iccv_2017 An implementation of semantic image synthesis via adversarial learning using PyTorch 145
leverxgroup/esrgan An enhanced deep learning framework for image super-resolution tasks using Generative Adversarial Networks and PyTorch. 151
akanimax/pro_gan_pytorch Implementation of a deep learning model for generating high-quality images with improved stability and variation. 538
mansimov/text2image A model that generates image patches from natural language descriptions by iteratively drawing and attending to relevant words. 594
lts4/universal Tools and libraries to generate perturbations for attacking image classification models 242
ika-rwth-aachen/cam2bev Transforms images from multiple cameras into semantically segmented bird's eye view images using deep learning 720