w2vv

Text-to-image predictor

A deep neural network architecture that predicts visual features from text to improve image and video caption retrieval

Word2VisualVec : Predicting Visual Features from Text for Image and Video Caption Retrieval

GitHub

69 stars

4 watching

13 forks

Language: Python

last commit: over 6 years ago

Linked from 1 awesome list

Backlinks from these awesome lists:

danieljf24/awesome-video-text-retrieval

Related projects:

Repository	Description	Stars
danieljf24/dual_encoding	A deep learning project that provides a video-text retrieval model and tools for training and evaluating it on the MSR-VTT dataset	154
imatge-upc/saliency-2016-cvpr	This project proposes a solution to predict salient areas in images using convolutional neural networks.	186
canjie-luo/moran_v2	A deep learning framework for scene text recognition with rectification and attention mechanisms.	639
abdulk084/smiles2vec	An implementation of a deep learning model for predicting chemical properties from molecular structure data	30
li-xirong/w2vvpp	A deep learning-based video search system using pre-trained models and datasets	28
danieljf24/hybrid_space	Develops a deep learning framework for video retrieval using text and computer vision	87
pxiangwu/motionnet	This project aims to predict motion and perceive scenes from bird's eye view maps for autonomous driving	169
mrgemy95/visual-interaction-networks-pytorch	An implementation of Deepmind's Visual Interaction Networks using PyTorch to predict future events in physical scenes.	166
jcjohnson/cnn-vis	This project enables users to generate images using convolutional neural networks (CNNs) and visualize their activations.	499
bjoux2/deepdtis_dbn	A Python framework for deep learning-based drug-target interaction prediction using a DBN architecture.	49
junyanz/realismcnn	Predicts and improves visual realism in composite images using deep learning techniques	64
deeprnn/image_captioning	This implementation allows users to generate captions from images using a neural network model with visual attention.	790
gist-csbl/deepconv-dti	A Python script for training and testing deep learning models to predict drug-target interactions	73
dougsm/ggcnn	A deep learning-based system for generating precise grasp poses for robots in real-time	525
jcjohnson/densecap	A deep learning framework for generating natural language descriptions of images by detecting objects and their attributes	1,584