 Chinese-Word-Vectors
 Chinese-Word-Vectors 
 Word vectors
 Provides pre-trained vectors with various properties for downstream tasks in natural language processing
100+ Chinese Word Vectors 上百种预训练中文词向量
12k stars
 285 watching
 2k forks
 
Language: Python 
last commit: almost 2 years ago   chinesechinese-word-segmentationembeddingembeddingsvectors-trainedword-embeddings 
 Related projects:
| Repository | Description | Stars | 
|---|---|---|
|  | A software framework for learning Chinese word embeddings with stroke n-gram information | 274 | 
|  | A toolkit for learning high-quality word and text representations from ngram co-occurrence statistics | 848 | 
|  | Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. | 806 | 
|  | Provides fast and efficient word embeddings for natural language processing. | 223 | 
|  | This is a software project that trains and evaluates word embeddings for Chinese words, characters, and fine-grained subcharacter components. | 99 | 
|  | Improves word embeddings by training with adversarial objectives | 118 | 
|  | Tools and techniques for analyzing word meanings from word embeddings | 212 | 
|  | An NLP project offering various text classification models and techniques for deep learning exploration | 7,881 | 
|  | A large-scale Chinese corpus for pre-training language models. | 927 | 
|  | An implementation of a large language model for Chinese text processing, focusing on MoE (Multi-Headed Attention) architecture and incorporating a vast vocabulary. | 645 | 
|  | A fast and efficient utility package for utilizing vector embeddings in machine learning models | 1,635 | 
|  | Provides pre-trained word vectors for multiple languages to facilitate NLP tasks | 2,216 | 
|  | A deep learning model that generates word embeddings by predicting words based on their dependency context | 291 | 
|  | Provides training and testing code for a CNN-based sentence embedding model | 2 | 
|  | A collection of precomputed word embeddings for the Spanish language, derived from different corpora and computational methods. | 354 |