bottom-up-attention

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

GitHub

1k stars
26 watching
379 forks
Language: Jupyter Notebook
last commit: over 1 year ago
Linked from 1 awesome list

caffecaptioning-imagesfaster-rcnnimage-captioningmscocomscoco-datasetvisual-question-answeringvqa

Backlinks from these awesome lists: