CLIP
Image-text predictor
A neural network trained on image and text pairs to predict the most relevant text snippet given an image
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
26k stars
325 watching
3k forks
Language: Jupyter Notebook
last commit: 4 months ago
Linked from 1 awesome list
deep-learningmachine-learning
Related projects:
Repository | Description | Stars |
---|---|---|
pharmapsychotic/clip-interrogator | A tool for optimizing text prompts to generate images with specific characteristics using CLIP and BLIP models. | 2,702 |
lucidrains/dalle2-pytorch | An implementation of DALL-E 2's text-to-image synthesis neural network in PyTorch | 11,148 |
jina-ai/clip-as-service | A service that provides fast and scalable image-text embeddings using CLIP models, supporting visual reasoning and integration with neural search ecosystems. | 12,455 |
karpathy/neuraltalk2 | Efficient image captioning model using a CNN followed by an RNN in deep learning on GPU | 5,511 |
orpatashnik/styleclip | This project provides an implementation of a method to manipulate images by driving the style with text. | 4,000 |
rom1504/clip-retrieval | A tool for efficiently computing and utilizing CLIP embeddings for semantic search in multimodal data | 2,411 |
minimaxir/gpt-2-simple | A tool for retraining and fine-tuning the OpenAI GPT-2 text generation model on new datasets. | 3,397 |
ai-forever/kandinsky-2 | A multilingual text2image latent diffusion model with improved aesthetics and controllability | 2,766 |
phillipi/pix2pix | An implementation of an image-to-image translation algorithm using deep learning techniques | 10,163 |
openai/glide-text2im | A diffusion-based text-conditional image synthesis model | 3,545 |
dmitryulyanov/deep-image-prior | A project demonstrating image restoration using neural networks without learning | 7,886 |
osai-ai/tensor-stream | Decodes real-time video streams into CUDA memory to support GPU-based deep learning processing | 380 |
zhixuhao/unet | An implementation of the U-Net architecture for image segmentation using Keras and TensorFlow. | 4,593 |
donnyyou/torchcv | A comprehensive PyTorch-based framework for computer vision tasks | 2,250 |
openai/gpt-2 | A repository providing code and models for research into language modeling and multitask learning | 22,516 |