MORAN_v2
Scene Text Recognizer
A deep learning framework for scene text recognition with rectification and attention mechanisms.
MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition
636 stars
24 watching
152 forks
Language: Python
last commit: 4 months ago attention-mechanismimage-deformationimage-rectificationscene-textscene-text-recognition
Related projects:
Repository | Description | Stars |
---|---|---|
andrewhou1/clps1520project | An implementation of a recurrent convolutional neural network for scene labeling | 5 |
csailvision/places365 | Provides pre-trained deep learning models for scene classification on the Places365 dataset | 1,926 |
sergioburdisso/pyss3 | A Python package implementing an interpretable machine learning model for text classification with visualization tools | 336 |
tangzhenyu/scene-text-understanding | A research project focused on developing algorithms and models to accurately detect and recognize text in images and videos from various scenes. | 368 |
hszhao/psanet | A deep learning framework for semantic segmentation with spatial attention mechanisms | 216 |
jiwei0921/dmra | A Python implementation of a depth-induced multi-scale recurrent attention network for RGB-D saliency detection | 105 |
ibm/max-scene-classifier | An image classification model for recognizing physical places and locations | 41 |
pathak22/context-encoder | Unsupervised feature learning by image inpainting using Generative Adversarial Networks (GANs) | 885 |
damo-nlp-sg/vcd | An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs | 209 |
danieljf24/w2vv | A deep neural network architecture that predicts visual features from text to improve image and video caption retrieval | 69 |
taokong/ron | A deep learning framework for object detection tasks using a novel neural network architecture | 355 |
rowanz/neural-motifs | A software framework for scene graph parsing with global context using PyTorch and Visual Genome data. | 525 |
oyxhust/cnn-lstm-ctc-text-recognition | Develops CTC-based text recognition models with neural network architectures | 259 |
pistony/residualattentionnetwork | A Gluon implementation of Residual Attention Network for image classification tasks | 107 |
aaronshan/12306-captcha | Deep learning-based system to recognize and classify 12306 captcha images | 280 |