Chinese-CLIP
Chinese CLIP
A deep learning framework for cross-modal retrieval and representation generation using large-scale Chinese datasets
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
5k stars
35 watching
470 forks
Language: Python
last commit: 4 months ago
Linked from 1 awesome list
chineseclipcomputer-visioncontrastive-losscoreml-modelsdeep-learningimage-text-retrievalmulti-modalmulti-modal-learningnlppretrained-modelspytorchtransformersvision-and-language-pre-trainingvision-language
Related projects:
Repository | Description | Stars |
---|---|---|
nvlabs/ffhq-dataset | A high-quality image dataset of human faces created to benchmark generative adversarial networks. | 3,746 |
tencentarc-qq/qa-clip | Provides Chinese language models with high performance for image-text retrieval and classification tasks. | 48 |
acccccccb/vue-img-cutter | An image cropping plugin for Vue.js applications | 489 |
openai/clip | A neural network trained on image and text pairs to predict the most relevant text snippet given an image | 25,919 |
tencentarc/gfpgan | An algorithm for restoring damaged or obscured faces in images | 35,898 |
rom1504/clip-retrieval | A tool for efficiently computing and utilizing CLIP embeddings for semantic search in multimodal data | 2,411 |
aaronshan/12306-captcha | Deep learning-based system to recognize and classify 12306 captcha images | 280 |
996refuse/zheye | A Python-based OCR system designed to recognize handwritten Chinese characters in vertical captcha images | 798 |
yongliang-wu/explorecfg | This project explores how varying configurations affect the performance of image captioning models | 27 |
wanshun123/facial-beauty-prediction | Web application that predicts facial beauty ratings from uploaded selfies using machine learning on a specific dataset. | 10 |
dog-qiuqiu/ultralight-simplepose | A lightweight human body posture key point model using computer vision and deep learning techniques. | 299 |
jin-s13/coco-wholebody | A large-scale benchmark and dataset for whole-body pose estimation in images | 762 |
sczhou/codeformer | A deep learning-based framework for enhancing and restoring images of faces in various conditions | 15,838 |
soloice/chinese-character-recognition | This project demonstrates how to build and train a convolutional neural network (CNN) to recognize Chinese characters. | 200 |
cszn/ircnn | This project trains deep CNN denoisers to improve image restoration tasks such as deblurring and demosaicking through model-based optimization methods. | 600 |