Scene-Text-Understanding
Text detection library
A research project focused on developing algorithms and models to accurately detect and recognize text in images and videos from various scenes.
OCR, Scene-Text-Understanding, Text Recognition
368 stars
28 watching
112 forks
Language: C++
last commit: over 4 years ago Related projects:
Repository | Description | Stars |
---|---|---|
tianzhi0549/ctpn | Detects text in images using a neural network architecture | 1,282 |
s3nh/text-detector | A tool for detecting and translating text from images. | 180 |
canjie-luo/moran_v2 | A deep learning framework for scene text recognition with rectification and attention mechanisms. | 639 |
yuhangzang/contextdet | An approach to detecting objects in images using multimodal large language models and contextual information | 208 |
2shou/textgrocery | A text classification tool based on LibLinear with support for Chinese tokenize using jieba. | 677 |
tahonermann/text_view | A C++ library providing iterator and range-based interfaces for encoding and decoding strings in various character encodings. | 122 |
kzykhys/text | A simple text manipulation library with a fluent interface. | 53 |
tensorflow/text | Preprocessing and processing tools for text data in machine learning models | 1,239 |
dinghanshen/swem | Reproduces the results of an ACL 2018 paper on simple word-embedding-based models for natural language processing tasks. | 284 |
peterc/whatlanguage | Language detection library using Bloom filters for speed and memory efficiency. | 685 |
jingzhang617/cod-rank-localize-and-segment | Develops a system to detect, segment, and rank camouflaged objects in images. | 74 |
bigbadbleucheese/kong | A .NET library that identifies characteristics of web browsers by parsing their User-Agent header strings. | 17 |
threedaymonk/text | A collection of text algorithms and similarity measures | 585 |
yknzhu/segdeepm | A tool for fine-tuning deep neural networks to improve object detection and segmentation capabilities by incorporating contextual information. | 27 |
sy-xuan/pink | This project enables multi-modal language models to understand and generate text about visual content using referential comprehension. | 79 |