bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
1k stars
26 watching
379 forks
Language: Jupyter Notebook
last commit: over 1 year ago
Linked from 1 awesome list
caffecaptioning-imagesfaster-rcnnimage-captioningmscocomscoco-datasetvisual-question-answeringvqa