Chinese-Transformer-XL
Chinese transformer model
A pre-trained Chinese language model based on the Transformer-XL architecture.
218 stars
10 watching
36 forks
Language: Python
last commit: almost 2 years ago Related projects:
Repository | Description | Stars |
---|---|---|
fastnlp/cpt | A pre-trained transformer model for natural language understanding and generation tasks in Chinese | 481 |
yunwentechnology/unilm | This project provides pre-trained models for natural language understanding and generation tasks using the UniLM architecture. | 438 |
hit-scir/chinese-mixtral-8x7b | An implementation of a large language model for Chinese text processing, focusing on MoE (Multi-Headed Attention) architecture and incorporating a vast vocabulary. | 641 |
brightmart/xlnet_zh | Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks | 230 |
tencent/tencent-hunyuan-large | This project makes a large language model accessible for research and development | 1,114 |
zhuiyitechnology/roformer | An enhanced transformer model with improved relative position embeddings for natural language processing tasks | 819 |
tongjilibo/bert4torch | An implementation of transformer models in PyTorch for natural language processing tasks | 1,241 |
zhuiyitechnology/gau-alpha | An implementation of a Gated Attention Unit-based Transformer model for natural language processing tasks | 96 |
openai/finetune-transformer-lm | This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,160 |
zhuiyitechnology/roformer-v2 | A faster and more effective text processing model based on the RoFormer architecture | 149 |
michael-wzhu/shennong-tcm-llm | Develops and deploys a large language model for Chinese traditional medicine applications | 299 |
cluebenchmark/cluepretrainedmodels | Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. | 804 |
ymcui/chinese-xlnet | Provides pre-trained models for Chinese natural language processing tasks using the XLNet architecture | 1,653 |
tsinghuaai/cpm-1-generate | Provides tools and scripts for generating text using a pre-trained Chinese language model | 1,588 |
thudm/mathglm | A Pytorch-based implementation of a language model capable of performing arithmetic operations without calculators. | 318 |