Scene-Text-Understanding

Text detection library

A research project focused on developing algorithms and models to accurately detect and recognize text in images and videos from various scenes.

OCR, Scene-Text-Understanding, Text Recognition

GitHub

368 stars
28 watching
112 forks
Language: C++
last commit: over 4 years ago

Related projects:

Repository Description Stars
tianzhi0549/ctpn Detects text in images using a neural network architecture 1,284
s3nh/text-detector A tool for detecting and translating text from images. 180
canjie-luo/moran_v2 A deep learning framework for scene text recognition with rectification and attention mechanisms. 636
yuhangzang/contextdet An approach to detecting objects in images using multimodal large language models and contextual information 202
2shou/textgrocery A text classification tool based on LibLinear with support for Chinese tokenize using jieba. 678
tahonermann/text_view A C++ library providing iterator and range-based interfaces for encoding and decoding strings in various character encodings. 122
kzykhys/text A simple text manipulation library with a fluent interface. 53
tensorflow/text Preprocessing and processing tools for text data in machine learning models 1,233
dinghanshen/swem A software project that implements word embedding-based models for text classification tasks and provides pre-trained embeddings and evaluation scripts. 284
peterc/whatlanguage Language detection library using Bloom filters for speed and memory efficiency. 685
jingzhang617/cod-rank-localize-and-segment Develops a system to detect, segment, and rank camouflaged objects in images. 74
bigbadbleucheese/kong A .NET library that identifies characteristics of web browsers by parsing their User-Agent header strings. 17
threedaymonk/text A collection of text algorithms and similarity measures 586
yknzhu/segdeepm A tool for fine-tuning deep neural networks to improve object detection and segmentation capabilities by incorporating contextual information. 27
sy-xuan/pink This project enables multi-modal language models to understand and generate text about visual content using referential comprehension. 76