recognize-anything
Image recognition library
Develops strong fundamental image recognition models with high accuracy and flexibility
Open-source and strong foundation image recognition models.
3k stars
28 watching
279 forks
Language: Jupyter Notebook
last commit: 7 months ago recognize-anythingtag2text-iclr2024
Related projects:
Repository | Description | Stars |
---|---|---|
| This project provides code and tools for running inference with a visual segmentation model that can generate object masks from input prompts. | 48,092 |
| An archive of pre-trained computer vision models. | 62 |
| Named Entity Recognition model using LSTM and CRF with character embeddings | 1,947 |
| Deep learning-based system to recognize and classify 12306 captcha images | 281 |
| A multi-modal AI model developed for improved instruction-following and in-context learning, utilizing large-scale architectures and various training datasets. | 3,570 |
| A large multi-modal model developed using the Llama3 language model, designed to improve image understanding capabilities. | 32 |
| Provides a large multi-label image database and pre-trained ResNet model for computer vision tasks | 3,055 |
| A large vision language model with improved image reasoning and text recognition capabilities, suitable for various multimodal tasks | 5,179 |
| A deep learning model for fast object segmentation | 7,575 |
| A tool for annotating images with tags and categories | 134 |
| A TensorFlow model for recognizing text in images using visual attention and a sequence-to-sequence architecture. | 1,079 |
| An open-source toolkit for training and deploying large-scale AI models on various downstream tasks with multi-modality | 3,840 |
| Provides a benchmarking framework and implementation for deep learning-based text recognition models | 3,769 |
| An NLP project offering various text classification models and techniques for deep learning exploration | 7,881 |
| A toolkit for building and deploying deep learning models in computer vision | 5,850 |