w2vv
Text-to-image predictor
A deep neural network architecture that predicts visual features from text to improve image and video caption retrieval
Word2VisualVec : Predicting Visual Features from Text for Image and Video Caption Retrieval
69 stars
4 watching
13 forks
Language: Python
last commit: almost 5 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
danieljf24/dual_encoding | A deep learning project that provides a video-text retrieval model and tools for training and evaluating it on the MSR-VTT dataset | 155 |
imatge-upc/saliency-2016-cvpr | This project proposes a solution to predict salient areas in images using convolutional neural networks. | 185 |
canjie-luo/moran_v2 | A deep learning framework for scene text recognition with rectification and attention mechanisms. | 636 |
abdulk084/smiles2vec | An implementation of a deep learning model for predicting chemical properties from molecular structure data | 30 |
li-xirong/w2vvpp | A deep learning-based video search system using pre-trained models and datasets | 28 |
danieljf24/hybrid_space | Develops a deep learning framework for video retrieval using text and computer vision | 87 |
pxiangwu/motionnet | Develops accurate perception and motion prediction models for autonomous vehicles based on 2D maps | 169 |
mrgemy95/visual-interaction-networks-pytorch | An implementation of Deepmind's Visual Interaction Networks using PyTorch to predict future events in physical scenes. | 166 |
jcjohnson/cnn-vis | This project enables users to generate images using convolutional neural networks (CNNs) and visualize their activations. | 496 |
bjoux2/deepdtis_dbn | A Python framework for deep learning-based drug-target interaction prediction using a DBN architecture. | 48 |
junyanz/realismcnn | Predicts and improves visual realism in composite images using deep learning techniques | 64 |
deeprnn/image_captioning | This implementation allows users to generate captions from images using a neural network model with visual attention. | 786 |
gist-csbl/deepconv-dti | A Python script for training and testing deep learning models to predict drug-target interactions | 71 |
dougsm/ggcnn | A deep learning-based system for generating precise grasp poses for robots in real-time | 514 |
jcjohnson/densecap | A deep learning framework for generating natural language descriptions of images by detecting objects and their attributes | 1,584 |