LERT

Linguistic Model

A pre-trained language model designed to leverage linguistic features and outperform comparable baselines on Chinese natural language understanding tasks.

LERT: A Linguistically-motivated Pre-trained Language Model（语言学信息增强的预训练模型LERT）

GitHub

202 stars

3 watching

15 forks

Language: Python

last commit: over 2 years ago

bertlertnlpplmpre-trainpytorchtensorflowtransformer

arxiv.org/abs/2211.05344

Related projects:

Repository	Description	Stars
ymcui/pert	Develops a pre-trained language model to learn semantic knowledge from permuted text without mask labels	356
ymcui/macbert	Improves pre-trained Chinese language models by incorporating a correction task to alleviate inconsistency issues with downstream tasks	646
ymcui/chinese-xlnet	Provides pre-trained models for Chinese natural language processing tasks using the XLNet architecture	1,652
ieit-yuan/yuan2.0-m32	A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation	182
ymcui/chinese-mobilebert	An implementation of MobileBERT, a pre-trained language model, in Python for NLP tasks.	81
ymcui/chinese-electra	Provides pre-trained Chinese language models based on the ELECTRA framework for natural language processing tasks	1,405
brightmart/xlnet_zh	Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks	230
zhuiyitechnology/pretrained-models	A collection of pre-trained language models for natural language processing tasks	989
ymcui/chinese-mixtral	Develops and releases Mixtral-based models for natural language processing tasks with a focus on Chinese text generation and understanding	589
yunwentechnology/unilm	This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese.	439
bilibili/index-1.9b	A lightweight, multilingual language model with a long context length	920
nkcs-iclab/linglong	A pre-trained Chinese language model with a modest parameter count, designed to be accessible and useful for researchers with limited computing resources.	18
vhellendoorn/code-lms	A guide to using pre-trained large language models in source code analysis and generation	1,789
yuangongnd/ltu	An audio and speech large language model implementation with pre-trained models, datasets, and inference options	396
zhuiyitechnology/wobert	A Word-based Chinese BERT model trained on large-scale text data using pre-trained models as a foundation	460