WoBERT

Word-based Chinese Model

A Word-based Chinese BERT model trained on large-scale text data using pre-trained models as a foundation

以词为基本单位的中文BERT

460 stars

8 watching

70 forks

Language: Python

last commit: over 3 years ago

Related projects:

Repository	Description	Stars
ymcui/macbert	Improves pre-trained Chinese language models by incorporating a correction task to alleviate inconsistency issues with downstream tasks	646
zhuiyitechnology/roformer-sim	An upgraded version of SimBERT with integrated retrieval and generation capabilities	441
brightmart/xlnet_zh	Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks	230
ethan-yt/guwenbert	Pre-trained language model for classical Chinese texts using RoBERTa architecture	511
ymcui/chinese-mobilebert	An implementation of MobileBERT, a pre-trained language model, in Python for NLP tasks.	81
turkunlp/wikibert	Provides pre-trained language models derived from Wikipedia texts for natural language processing tasks	34
yunwentechnology/unilm	This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese.	439
zhuiyitechnology/pretrained-models	A collection of pre-trained language models for natural language processing tasks	989
tchayintr/thbert	A pre-trained BERT model designed to facilitate NLP research and development with limited Thai language resources	6
zhuiyitechnology/t5-pegasus	Pretrained Chinese text generation model trained on large-scale data	558
thunlp-aipoet/bert-ccpoem	A BERT-based pre-trained model for Chinese classical poetry	146
ymcui/pert	Develops a pre-trained language model to learn semantic knowledge from permuted text without mask labels	356
ieit-yuan/yuan2.0-m32	A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation	182
cluebenchmark/cluepretrainedmodels	Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models.	806
zhuiyitechnology/gau-alpha	An implementation of a transformer-based NLP model utilizing gated attention units	98