MORAN_v2

Scene Text Recognizer

A deep learning framework for scene text recognition with rectification and attention mechanisms.

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition

GitHub

636 stars
24 watching
152 forks
Language: Python
last commit: 4 months ago
attention-mechanismimage-deformationimage-rectificationscene-textscene-text-recognition

Related projects:

Repository Description Stars
andrewhou1/clps1520project An implementation of a recurrent convolutional neural network for scene labeling 5
csailvision/places365 Provides pre-trained deep learning models for scene classification on the Places365 dataset 1,926
sergioburdisso/pyss3 A Python package implementing an interpretable machine learning model for text classification with visualization tools 336
tangzhenyu/scene-text-understanding A research project focused on developing algorithms and models to accurately detect and recognize text in images and videos from various scenes. 368
hszhao/psanet A deep learning framework for semantic segmentation with spatial attention mechanisms 216
jiwei0921/dmra A Python implementation of a depth-induced multi-scale recurrent attention network for RGB-D saliency detection 105
ibm/max-scene-classifier An image classification model for recognizing physical places and locations 41
pathak22/context-encoder Unsupervised feature learning by image inpainting using Generative Adversarial Networks (GANs) 885
damo-nlp-sg/vcd An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs 209
danieljf24/w2vv A deep neural network architecture that predicts visual features from text to improve image and video caption retrieval 69
taokong/ron A deep learning framework for object detection tasks using a novel neural network architecture 355
rowanz/neural-motifs A software framework for scene graph parsing with global context using PyTorch and Visual Genome data. 525
oyxhust/cnn-lstm-ctc-text-recognition Develops CTC-based text recognition models with neural network architectures 259
pistony/residualattentionnetwork A Gluon implementation of Residual Attention Network for image classification tasks 107
aaronshan/12306-captcha Deep learning-based system to recognize and classify 12306 captcha images 280