Chinese-CLIP

Chinese CLIP

A deep learning framework for cross-modal retrieval and representation generation using large-scale Chinese datasets

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

GitHub

5k stars
35 watching
470 forks
Language: Python
last commit: 4 months ago
Linked from 1 awesome list

chineseclipcomputer-visioncontrastive-losscoreml-modelsdeep-learningimage-text-retrievalmulti-modalmulti-modal-learningnlppretrained-modelspytorchtransformersvision-and-language-pre-trainingvision-language

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
nvlabs/ffhq-dataset A high-quality image dataset of human faces created to benchmark generative adversarial networks. 3,746
tencentarc-qq/qa-clip Provides Chinese language models with high performance for image-text retrieval and classification tasks. 48
acccccccb/vue-img-cutter An image cropping plugin for Vue.js applications 489
openai/clip A neural network trained on image and text pairs to predict the most relevant text snippet given an image 25,919
tencentarc/gfpgan An algorithm for restoring damaged or obscured faces in images 35,898
rom1504/clip-retrieval A tool for efficiently computing and utilizing CLIP embeddings for semantic search in multimodal data 2,411
aaronshan/12306-captcha Deep learning-based system to recognize and classify 12306 captcha images 280
996refuse/zheye A Python-based OCR system designed to recognize handwritten Chinese characters in vertical captcha images 798
yongliang-wu/explorecfg This project explores how varying configurations affect the performance of image captioning models 27
wanshun123/facial-beauty-prediction Web application that predicts facial beauty ratings from uploaded selfies using machine learning on a specific dataset. 10
dog-qiuqiu/ultralight-simplepose A lightweight human body posture key point model using computer vision and deep learning techniques. 299
jin-s13/coco-wholebody A large-scale benchmark and dataset for whole-body pose estimation in images 762
sczhou/codeformer A deep learning-based framework for enhancing and restoring images of faces in various conditions 15,838
soloice/chinese-character-recognition This project demonstrates how to build and train a convolutional neural network (CNN) to recognize Chinese characters. 200
cszn/ircnn This project trains deep CNN denoisers to improve image restoration tasks such as deblurring and demosaicking through model-based optimization methods. 600