attend_infer_repeat

Scene understanding model

An implementation of Attend, Infer, Repeat, a method for fast scene understanding using generative models.

A Tensorfflow implementation of Attend, Infer, Repeat

GitHub

82 stars
4 watching
20 forks
Language: Python
last commit: almost 6 years ago
Linked from 1 awesome list

attend-infer-repeatattentionattention-mechanismcomputer-graphicscomputer-visiongenerative-modelneural-networksrnntensorflowvae

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
renmengye/rec-attend-public An implementation of end-to-end instance segmentation using recurrent attention 109
yunjey/show-attend-and-tell Generates captions for images using an attention-based neural network 907
visionlearninggroup/ask_attend_and_answer Develops a deep learning model to answer questions about visual scenes based on spatial attention and question guidance 25
emedvedev/attention-ocr A TensorFlow model for recognizing text in images using visual attention and a sequence-to-sequence architecture. 1,077
ucas-haoranwei/vary An implementation of a vision vocabulary model for large language models to improve document understanding and recognition capabilities 1,817
javeywang/pyramid-attention-networks-pytorch An implementation of a deep learning model using PyTorch for semantic segmentation tasks. 235
jazzsaxmafia/show_attend_and_tell.tensorflow A TensorFlow implementation of a neural caption generator using attention mechanisms. 506
kaushalshetty/structured-self-attention A deep learning model that generates sentence embeddings using structured self-attention and is used for binary and multiclass classification tasks. 494
tianyi-lab/hallusionbench An image-context reasoning benchmark designed to challenge large vision-language models and help improve their accuracy 243
pbiecek/xaiaterum2020 An R package and workshop materials for explaining machine learning models using explainable AI techniques 52
sicara/tf-explain A library providing interpretability methods for TensorFlow 2.x models 1,018
interpretml/dice Provides counterfactual explanations for machine learning models to facilitate interpretability and understanding. 1,364
robsyme/nf-repeatmasking Automates repeat detection and masking in genomic data analysis 13
isekai-portal/link-context-learning An implementation of a multimodal learning approach to improve language models' ability to recognize unseen images and understand novel concepts. 89
sarababakn/mfcl-neurips23 A framework for mitigating catastrophic forgetting in federated learning for vision tasks using data synthesis from past distributions. 15