attention-ocr
Image OCR model
A TensorFlow model for recognizing text in images using visual attention and a sequence-to-sequence architecture.
A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.
1k stars
48 watching
258 forks
Language: Python
last commit: about 1 year ago
Linked from 1 awesome list
cnngoogle-cloudgoogle-cloud-mlhacktoberfestimage-recognitionmachine-learningmlocrocr-recognitionseq2seqtensorflow
Related projects:
Repository | Description | Stars |
---|---|---|
arunmichaeldsouza/tensorflow-image-detection | A tool for training and classifying images using Google's Inception model and TensorFlow | 328 |
preritj/segmentation | Deep learning models for semantic segmentation of images | 100 |
vladkryvoruchko/pspnet-keras-tensorflow | An implementation of a deep learning model for image segmentation using TensorFlow and Keras | 394 |
ibm/max-ocr | An optical character recognition system deployed as a web service using a trained Tesseract OCR model | 47 |
dinghanshen/swem | A software project that implements word embedding-based models for text classification tasks and provides pre-trained embeddings and evaluation scripts. | 284 |
tensorflow/text | Preprocessing and processing tools for text data in machine learning models | 1,233 |
oyxhust/cnn-lstm-ctc-text-recognition | Develops CTC-based text recognition models with neural network architectures | 259 |
isekai-portal/link-context-learning | An implementation of a multimodal learning approach to improve language models' ability to recognize unseen images and understand novel concepts. | 89 |
kwotsin/tensorflow-enet | A deep neural network implementation for real-time semantic segmentation in computer vision | 257 |
shekkizh/fcn.tensorflow | An implementation of a deep learning model for image segmentation using TensorFlow | 1,252 |
shubham-shahh/open-source-models | An archive of pre-trained computer vision models. | 61 |
tensorflow/tfjs | An open-source JavaScript library for training and deploying machine learning models using WebGL acceleration. | 18,495 |
hasnainraz/fc-densenet-tensorflow | Re-implementation of a 100-layer fully convolutional network architecture for image segmentation | 123 |
lxtgh/omg-seg | Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model. | 1,300 |