gill
Image generator
A software framework for generating images and text using large language models
🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".
440 stars
15 watching
38 forks
Language: Jupyter Notebook
last commit: 12 months ago computer-visionlarge-language-modelsmachine-learningnatural-language-processing
Related projects:
Repository | Description | Stars |
---|---|---|
kohjingyu/fromage | A framework for grounding language models to images and handling multimodal inputs and outputs | 478 |
mingyuliutw/cogan | An implementation of a Generative Adversarial Network (GAN) designed to generate diverse types of images from single input images | 286 |
mansimov/text2image | A model that generates image patches from natural language descriptions by iteratively drawing and attending to relevant words. | 594 |
google-research/parti | An autoregressive text-to-image generation model that generates photorealistic images from text prompts and leverages advances in large language models. | 1,554 |
ibm/max-fast-neural-style-transfer | A service for generating new images by mixing the content of an input image with the style of another image. | 51 |
pixray/pixray | An image generation system built around CLIP and GAN techniques. | 1,030 |
zsdonghao/text-to-image | A TensorFlow implementation of generating images from text descriptions using a Generative Adversarial Network (GAN) architecture | 602 |
google-research/xmcgan_image_generation | This implementation enables text-to-image generation by leveraging cross-modal contrastive learning. | 98 |
nousr/koi | An AI-powered plugin for Krita that enables img2img generation using Stable Diffusion models | 445 |
mingyuliutw/unit | An unsupervised deep learning framework for translating images between different modalities | 1,994 |
ibm/max-image-caption-generator | An image caption generation system utilizing machine learning models and deep neural networks. | 84 |
google/sg2im | An end-to-end neural network model that generates images from scene graphs by processing input graph information through multiple layers of networks | 1,302 |
gligen/gligen | A system that enables new capabilities in frozen text-to-image generation models to ground on various prompts, including boxes, keypoints, and images. | 2,036 |
taoxugit/attngan | Reproduces text-to-image generation with attentional generative adversarial networks. | 1,343 |
yuweihao/kern | An open-source implementation of a graph neural network architecture for scene graph generation in computer vision | 121 |