Chinese-Transformer-XL
Chinese transformer model
A pre-trained Chinese language model based on the Transformer-XL architecture.
218 stars
10 watching
36 forks
Language: Python
last commit: about 2 years ago Related projects:
Repository | Description | Stars |
---|---|---|
fastnlp/cpt | A pre-trained transformer model for natural language understanding and generation tasks in Chinese | 482 |
yunwentechnology/unilm | This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese. | 439 |
hit-scir/chinese-mixtral-8x7b | An implementation of a large language model for Chinese text processing, focusing on MoE (Multi-Headed Attention) architecture and incorporating a vast vocabulary. | 645 |
brightmart/xlnet_zh | Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks | 230 |
tencent/tencent-hunyuan-large | This project makes a large language model accessible for research and development | 1,245 |
zhuiyitechnology/roformer | An enhanced transformer model with improved relative position embeddings for natural language processing tasks | 837 |
tongjilibo/bert4torch | An implementation of transformer models in PyTorch for natural language processing tasks | 1,257 |
zhuiyitechnology/gau-alpha | An implementation of a transformer-based NLP model utilizing gated attention units | 98 |
openai/finetune-transformer-lm | This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,167 |
zhuiyitechnology/roformer-v2 | An improved version of a transformer-based language model with enhanced speed and accuracy through structural simplification and pre-training | 148 |
michael-wzhu/shennong-tcm-llm | Develops and deploys a large language model for Chinese traditional medicine applications | 316 |
cluebenchmark/cluepretrainedmodels | Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. | 806 |
ymcui/chinese-xlnet | Provides pre-trained models for Chinese natural language processing tasks using the XLNet architecture | 1,652 |
tsinghuaai/cpm-1-generate | Provides tools and scripts for generating text using a pre-trained Chinese language model | 1,588 |
thudm/mathglm | A Pytorch-based implementation of a language model capable of performing arithmetic operations without calculators. | 320 |