gill

Image generator

A software framework for generating images and text using large language models

🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".

GitHub

440 stars

15 watching

38 forks

Language: Jupyter Notebook

last commit: over 1 year ago

computer-visionlarge-language-modelsmachine-learningnatural-language-processing

jykoh.com/gill

Related projects:

Repository	Description	Stars
kohjingyu/fromage	A framework for grounding language models to images and handling multimodal inputs and outputs	478
mingyuliutw/cogan	An implementation of a Generative Adversarial Network (GAN) designed to generate diverse types of images from single input images	286
mansimov/text2image	A model that generates image patches from natural language descriptions by iteratively drawing and attending to relevant words.	594
google-research/parti	An autoregressive text-to-image generation model that generates photorealistic images from text prompts and leverages advances in large language models.	1,554
ibm/max-fast-neural-style-transfer	A service for generating new images by mixing the content of an input image with the style of another image.	51
pixray/pixray	An image generation system built around CLIP and GAN techniques.	1,030
zsdonghao/text-to-image	A TensorFlow implementation of generating images from text descriptions using a Generative Adversarial Network (GAN) architecture	602
google-research/xmcgan_image_generation	This implementation enables text-to-image generation by leveraging cross-modal contrastive learning.	98
nousr/koi	An AI-powered plugin for Krita that enables img2img generation using Stable Diffusion models	445
mingyuliutw/unit	An unsupervised deep learning framework for translating images between different modalities	1,994
ibm/max-image-caption-generator	An image caption generation system utilizing machine learning models and deep neural networks.	84
google/sg2im	An end-to-end neural network model that generates images from scene graphs by processing input graph information through multiple layers of networks	1,302
gligen/gligen	A system that enables new capabilities in frozen text-to-image generation models to ground on various prompts, including boxes, keypoints, and images.	2,036
taoxugit/attngan	Reproduces text-to-image generation with attentional generative adversarial networks.	1,343
yuweihao/kern	An open-source implementation of a graph neural network architecture for scene graph generation in computer vision	121