attention-ocr

Image OCR model

A TensorFlow model for recognizing text in images using visual attention and a sequence-to-sequence architecture.

A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.

GitHub

1k stars
48 watching
258 forks
Language: Python
last commit: about 1 year ago
Linked from 1 awesome list

cnngoogle-cloudgoogle-cloud-mlhacktoberfestimage-recognitionmachine-learningmlocrocr-recognitionseq2seqtensorflow

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
arunmichaeldsouza/tensorflow-image-detection A tool for training and classifying images using Google's Inception model and TensorFlow 328
preritj/segmentation Deep learning models for semantic segmentation of images 100
vladkryvoruchko/pspnet-keras-tensorflow An implementation of a deep learning model for image segmentation using TensorFlow and Keras 394
ibm/max-ocr An optical character recognition system deployed as a web service using a trained Tesseract OCR model 47
dinghanshen/swem A software project that implements word embedding-based models for text classification tasks and provides pre-trained embeddings and evaluation scripts. 284
tensorflow/text Preprocessing and processing tools for text data in machine learning models 1,233
oyxhust/cnn-lstm-ctc-text-recognition Develops CTC-based text recognition models with neural network architectures 259
isekai-portal/link-context-learning An implementation of a multimodal learning approach to improve language models' ability to recognize unseen images and understand novel concepts. 89
kwotsin/tensorflow-enet A deep neural network implementation for real-time semantic segmentation in computer vision 257
shekkizh/fcn.tensorflow An implementation of a deep learning model for image segmentation using TensorFlow 1,252
shubham-shahh/open-source-models An archive of pre-trained computer vision models. 61
tensorflow/tfjs An open-source JavaScript library for training and deploying machine learning models using WebGL acceleration. 18,495
hasnainraz/fc-densenet-tensorflow Re-implementation of a 100-layer fully convolutional network architecture for image segmentation 123
lxtgh/omg-seg Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model. 1,300