GAU-alpha

Transformer model

An implementation of a transformer-based NLP model utilizing gated attention units

基于Gated Attention Unit的Transformer模型(尝鲜版)

GitHub

98 stars
4 watching
9 forks
Language: Python
last commit: almost 2 years ago

Related projects:

Repository Description Stars
zhuiyitechnology/roformer An enhanced transformer model with improved relative position embeddings for natural language processing tasks 837
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,167
zhuiyitechnology/pretrained-models A collection of pre-trained language models for natural language processing tasks 989
german-nlp-group/german-transformer-training Trains German transformer models to improve language understanding 23
01-ai/yi A series of large language models trained from scratch to excel in multiple NLP tasks 7,743
tongjilibo/bert4torch An implementation of transformer models in PyTorch for natural language processing tasks 1,257
zhuiyitechnology/roformer-v2 An improved version of a transformer-based language model with enhanced speed and accuracy through structural simplification and pre-training 148
fastnlp/cpt A pre-trained transformer model for natural language understanding and generation tasks in Chinese 482
ymcui/chinese-xlnet Provides pre-trained models for Chinese natural language processing tasks using the XLNet architecture 1,652
zhuiyitechnology/wobert A Word-based Chinese BERT model trained on large-scale text data using pre-trained models as a foundation 460
shawn-ieitsystems/yuan-1.0 Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing 591
leviswind/pytorch-transformer Implementation of a transformer-based translation model in PyTorch 240
huggingface/pytorch-openai-transformer-lm Implementing OpenAI's transformer language model in PyTorch with pre-trained weights and fine-tuning capabilities 1,511
yunwentechnology/unilm This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese. 439
thudm/chinese-transformer-xl A pre-trained Chinese language model based on the Transformer-XL architecture. 218