CLIP

Image-text predictor

A neural network trained on image and text pairs to predict the most relevant text snippet given an image

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

GitHub

26k stars

325 watching

3k forks

Language: Jupyter Notebook

last commit: about 1 year ago

Linked from 1 awesome list

deep-learningmachine-learning

Backlinks from these awesome lists:

amrzv/awesome-colab-notebooks

Related projects:

Repository	Description	Stars
pharmapsychotic/clip-interrogator	A tool for optimizing text prompts to generate images with specific characteristics using CLIP and BLIP models.	2,725
lucidrains/dalle2-pytorch	An implementation of DALL-E 2's text-to-image synthesis neural network in PyTorch	11,184
jina-ai/clip-as-service	A service that provides fast and scalable image-text embeddings using CLIP models, supporting visual reasoning and integration with neural search ecosystems.	12,497
karpathy/neuraltalk2	Efficient image captioning model using a CNN followed by an RNN in deep learning on GPU	5,515
orpatashnik/styleclip	This project provides an implementation of a method to manipulate images by driving the style with text.	4,025
rom1504/clip-retrieval	A tool for efficiently computing and utilizing CLIP embeddings for semantic search in multimodal data	2,440
minimaxir/gpt-2-simple	A tool for retraining and fine-tuning the OpenAI GPT-2 text generation model on new datasets.	3,398
ai-forever/kandinsky-2	A multilingual text2image latent diffusion model with improved aesthetics and controllability	2,774
phillipi/pix2pix	An implementation of an image-to-image translation algorithm using deep learning techniques	10,208
openai/glide-text2im	A diffusion-based text-conditional image synthesis model	3,562
dmitryulyanov/deep-image-prior	A project demonstrating image restoration using neural networks without learning	7,920
osai-ai/tensor-stream	Decodes real-time video streams into CUDA memory to support GPU-based deep learning processing	380
zhixuhao/unet	An implementation of the U-Net architecture for image segmentation using Keras and TensorFlow.	4,623
donnyyou/torchcv	A comprehensive PyTorch-based framework for computer vision tasks	2,249
openai/gpt-2	A repository providing code and models for research into language modeling and multitask learning	22,644