gill
Image generator
A software framework for generating images and text using large language models
🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".
440 stars
15 watching
38 forks
Language: Jupyter Notebook
last commit: almost 2 years ago computer-visionlarge-language-modelsmachine-learningnatural-language-processing
Related projects:
Repository | Description | Stars |
---|---|---|
| A framework for grounding language models to images and handling multimodal inputs and outputs | 478 |
| An implementation of a Generative Adversarial Network (GAN) designed to generate diverse types of images from single input images | 286 |
| A model that generates image patches from natural language descriptions by iteratively drawing and attending to relevant words. | 594 |
| An autoregressive text-to-image generation model that generates photorealistic images from text prompts and leverages advances in large language models. | 1,554 |
| A service for generating new images by mixing the content of an input image with the style of another image. | 51 |
| An image generation system built around CLIP and GAN techniques. | 1,030 |
| A TensorFlow implementation of generating images from text descriptions using a Generative Adversarial Network (GAN) architecture | 602 |
| This implementation enables text-to-image generation by leveraging cross-modal contrastive learning. | 98 |
| An AI-powered plugin for Krita that enables img2img generation using Stable Diffusion models | 445 |
| An unsupervised deep learning framework for translating images between different modalities | 1,994 |
| An image caption generation system utilizing machine learning models and deep neural networks. | 84 |
| An end-to-end neural network model that generates images from scene graphs by processing input graph information through multiple layers of networks | 1,302 |
| A system that enables new capabilities in frozen text-to-image generation models to ground on various prompts, including boxes, keypoints, and images. | 2,036 |
| Reproduces text-to-image generation with attentional generative adversarial networks. | 1,343 |
| An open-source implementation of a graph neural network architecture for scene graph generation in computer vision | 121 |