Learning visually grounded word embeddings using Abstract scenes
GitHub
satwikkottur.github.io/VisualWord2Vec/