CLIP
Image-text predictor
A neural network trained on image and text pairs to predict the most relevant text snippet given an image
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
26k stars
325 watching
3k forks
Language: Jupyter Notebook
last commit: 7 months ago
Linked from 1 awesome list
deep-learningmachine-learning
Related projects:
Repository | Description | Stars |
---|---|---|
| A tool for optimizing text prompts to generate images with specific characteristics using CLIP and BLIP models. | 2,725 |
| An implementation of DALL-E 2's text-to-image synthesis neural network in PyTorch | 11,184 |
| A service that provides fast and scalable image-text embeddings using CLIP models, supporting visual reasoning and integration with neural search ecosystems. | 12,497 |
| Efficient image captioning model using a CNN followed by an RNN in deep learning on GPU | 5,515 |
| This project provides an implementation of a method to manipulate images by driving the style with text. | 4,025 |
| A tool for efficiently computing and utilizing CLIP embeddings for semantic search in multimodal data | 2,440 |
| A tool for retraining and fine-tuning the OpenAI GPT-2 text generation model on new datasets. | 3,398 |
| A multilingual text2image latent diffusion model with improved aesthetics and controllability | 2,774 |
| An implementation of an image-to-image translation algorithm using deep learning techniques | 10,208 |
| A diffusion-based text-conditional image synthesis model | 3,562 |
| A project demonstrating image restoration using neural networks without learning | 7,920 |
| Decodes real-time video streams into CUDA memory to support GPU-based deep learning processing | 380 |
| An implementation of the U-Net architecture for image segmentation using Keras and TensorFlow. | 4,623 |
| A comprehensive PyTorch-based framework for computer vision tasks | 2,249 |
| A repository providing code and models for research into language modeling and multitask learning | 22,644 |