Chinese-Transformer-XL

Chinese transformer model

A pre-trained Chinese language model based on the Transformer-XL architecture.

GitHub

218 stars
10 watching
36 forks
Language: Python
last commit: almost 2 years ago

Related projects:

Repository Description Stars
fastnlp/cpt A pre-trained transformer model for natural language understanding and generation tasks in Chinese 481
yunwentechnology/unilm This project provides pre-trained models for natural language understanding and generation tasks using the UniLM architecture. 438
hit-scir/chinese-mixtral-8x7b An implementation of a large language model for Chinese text processing, focusing on MoE (Multi-Headed Attention) architecture and incorporating a vast vocabulary. 641
brightmart/xlnet_zh Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks 230
tencent/tencent-hunyuan-large This project makes a large language model accessible for research and development 1,114
zhuiyitechnology/roformer An enhanced transformer model with improved relative position embeddings for natural language processing tasks 819
tongjilibo/bert4torch An implementation of transformer models in PyTorch for natural language processing tasks 1,241
zhuiyitechnology/gau-alpha An implementation of a Gated Attention Unit-based Transformer model for natural language processing tasks 96
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,160
zhuiyitechnology/roformer-v2 A faster and more effective text processing model based on the RoFormer architecture 149
michael-wzhu/shennong-tcm-llm Develops and deploys a large language model for Chinese traditional medicine applications 299
cluebenchmark/cluepretrainedmodels Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. 804
ymcui/chinese-xlnet Provides pre-trained models for Chinese natural language processing tasks using the XLNet architecture 1,653
tsinghuaai/cpm-1-generate Provides tools and scripts for generating text using a pre-trained Chinese language model 1,588
thudm/mathglm A Pytorch-based implementation of a language model capable of performing arithmetic operations without calculators. 318