gill

Image generator

A software framework for generating images and text using large language models

🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".

GitHub

430 stars
15 watching
36 forks
Language: Jupyter Notebook
last commit: 10 months ago
computer-visionlarge-language-modelsmachine-learningnatural-language-processing

Related projects:

Repository Description Stars
kohjingyu/fromage A framework for grounding language models to images and handling multimodal inputs and outputs 478
mingyuliutw/cogan An implementation of a Generative Adversarial Network (GAN) designed to generate diverse types of images from single input images 285
mansimov/text2image A model that generates image patches from natural language descriptions by iteratively drawing and attending to relevant words. 592
google-research/parti An autoregressive text-to-image generation model that generates photorealistic images from text prompts and leverages advances in large language models. 1,548
ibm/max-fast-neural-style-transfer A service for generating new images by mixing the content of an input image with the style of another image. 50
pixray/pixray An image generation system built around CLIP and GAN techniques. 1,027
zsdonghao/text-to-image A TensorFlow implementation of generating images from text descriptions using a Generative Adversarial Network (GAN) architecture 599
google-research/xmcgan_image_generation This implementation enables text-to-image generation by leveraging cross-modal contrastive learning. 98
nousr/koi An AI-powered plugin for Krita that enables img2img generation using Stable Diffusion models 445
mingyuliutw/unit An unsupervised deep learning framework for translating images between different modalities 1,988
ibm/max-image-caption-generator An image caption generation system utilizing machine learning models and deep neural networks. 84
google/sg2im An end-to-end neural network model that generates images from scene graphs by processing input graph information through multiple layers of networks 1,300
gligen/gligen A system that enables new capabilities in frozen text-to-image generation models to ground on various prompts, including boxes, keypoints, and images. 2,016
taoxugit/attngan Reproduces text-to-image generation with attentional generative adversarial networks. 1,339
yuweihao/kern An open-source implementation of a graph neural network architecture for scene graph generation in computer vision 120