ELECTRA
Chinese Language Model
Trains and evaluates a Chinese language model using adversarial training on a large corpus.
中文 预训练 ELECTRA 模型: 基于对抗学习 pretrain Chinese Model
140 stars
9 watching
11 forks
last commit: about 5 years ago adversarial-networksalbertbertelectraganlanguage-modelpretrained-models
Related projects:
Repository | Description | Stars |
---|---|---|
| Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. | 806 |
| A large-scale Chinese corpus for pre-training language models. | 927 |
| Provides pre-trained Chinese language models based on the ELECTRA framework for natural language processing tasks | 1,405 |
| Large language model for dialogue support in multiple languages | 1,903 |
| An updated version of a large language model designed to improve performance on multiple tasks and datasets | 13 |
| Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks | 230 |
| A pre-trained language model for multiple natural language processing tasks with support for few-shot learning and transfer learning. | 656 |
| A benchmarking platform for evaluating Chinese general-purpose models through anonymous, random battles | 143 |
| A deep learning model that incorporates visual and phonetic features of Chinese characters to improve its ability to understand Chinese language nuances | 545 |
| Measures the understanding of massive multitask Chinese datasets using large language models | 87 |
| This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese. | 439 |
| Improves pre-trained Chinese language models by incorporating a correction task to alleviate inconsistency issues with downstream tasks | 646 |
| Pre-trained language model for classical Chinese texts using RoBERTa architecture | 511 |
| A repository of pre-trained language models for various tasks and domains. | 121 |
| An implementation of a large language model for Chinese text processing, focusing on MoE (Multi-Headed Attention) architecture and incorporating a vast vocabulary. | 645 |