CLIP

Image-text predictor

A neural network trained on image and text pairs to predict the most relevant text snippet given an image

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

GitHub

26k stars
325 watching
3k forks
Language: Jupyter Notebook
last commit: 4 months ago
Linked from 1 awesome list

deep-learningmachine-learning

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
pharmapsychotic/clip-interrogator A tool for optimizing text prompts to generate images with specific characteristics using CLIP and BLIP models. 2,702
lucidrains/dalle2-pytorch An implementation of DALL-E 2's text-to-image synthesis neural network in PyTorch 11,148
jina-ai/clip-as-service A service that provides fast and scalable image-text embeddings using CLIP models, supporting visual reasoning and integration with neural search ecosystems. 12,455
karpathy/neuraltalk2 Efficient image captioning model using a CNN followed by an RNN in deep learning on GPU 5,511
orpatashnik/styleclip This project provides an implementation of a method to manipulate images by driving the style with text. 4,000
rom1504/clip-retrieval A tool for efficiently computing and utilizing CLIP embeddings for semantic search in multimodal data 2,411
minimaxir/gpt-2-simple A tool for retraining and fine-tuning the OpenAI GPT-2 text generation model on new datasets. 3,397
ai-forever/kandinsky-2 A multilingual text2image latent diffusion model with improved aesthetics and controllability 2,766
phillipi/pix2pix An implementation of an image-to-image translation algorithm using deep learning techniques 10,163
openai/glide-text2im A diffusion-based text-conditional image synthesis model 3,545
dmitryulyanov/deep-image-prior A project demonstrating image restoration using neural networks without learning 7,886
osai-ai/tensor-stream Decodes real-time video streams into CUDA memory to support GPU-based deep learning processing 380
zhixuhao/unet An implementation of the U-Net architecture for image segmentation using Keras and TensorFlow. 4,593
donnyyou/torchcv A comprehensive PyTorch-based framework for computer vision tasks 2,250
openai/gpt-2 A repository providing code and models for research into language modeling and multitask learning 22,516