attention-ocr

Image OCR model

A TensorFlow model for recognizing text in images using visual attention and a sequence-to-sequence architecture.

A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.

GitHub

1k stars

48 watching

256 forks

Language: Python

last commit: over 1 year ago

Linked from 1 awesome list

cnngoogle-cloudgoogle-cloud-mlhacktoberfestimage-recognitionmachine-learningmlocrocr-recognitionseq2seqtensorflow

Backlinks from these awesome lists:

kba/awesome-ocr

Related projects:

Repository	Description	Stars
arunmichaeldsouza/tensorflow-image-detection	A tool for training and classifying images using Google's Inception model and TensorFlow	329
preritj/segmentation	Deep learning models for semantic segmentation of images	101
vladkryvoruchko/pspnet-keras-tensorflow	An implementation of a deep learning model for image segmentation using TensorFlow and Keras	394
ibm/max-ocr	An optical character recognition system deployed as a web service using a trained Tesseract OCR model	47
dinghanshen/swem	Reproduces the results of an ACL 2018 paper on simple word-embedding-based models for natural language processing tasks.	284
tensorflow/text	Preprocessing and processing tools for text data in machine learning models	1,239
oyxhust/cnn-lstm-ctc-text-recognition	Develops CTC-based text recognition models with neural network architectures	259
isekai-portal/link-context-learning	An implementation of a multimodal learning approach to improve language models' ability to recognize unseen images and understand novel concepts.	91
kwotsin/tensorflow-enet	A deep neural network implementation for real-time semantic segmentation in computer vision	257
shekkizh/fcn.tensorflow	An implementation of a deep learning model for image segmentation using TensorFlow	1,251
shubham-shahh/open-source-models	An archive of pre-trained computer vision models.	62
tensorflow/tfjs	An open-source JavaScript library for training and deploying machine learning models using WebGL acceleration.	18,533
hasnainraz/fc-densenet-tensorflow	Re-implementation of a 100-layer fully convolutional network architecture for image segmentation	123
lxtgh/omg-seg	Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model.	1,336