Unilm

Chinese NLU/NGL toolkit

This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese.

439 stars

6 watching

86 forks

Language: Python

last commit: over 4 years ago

Related projects:

Repository	Description	Stars
ieit-yuan/yuan2.0-m32	A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation	182
tsinghuaai/cpm	Develops large-scale pre-trained models for Chinese natural language understanding and generative tasks with the goal of building efficient and effective models for various applications.	163
zhuiyitechnology/pretrained-models	A collection of pre-trained language models for natural language processing tasks	989
ymcui/chinese-xlnet	Provides pre-trained models for Chinese natural language processing tasks using the XLNet architecture	1,652
brightmart/xlnet_zh	Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks	230
tencent/tencent-hunyuan-large	This project makes a large language model accessible for research and development	1,245
shawn-ieitsystems/yuan-1.0	Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing	591
nkcs-iclab/linglong	A pre-trained Chinese language model with a modest parameter count, designed to be accessible and useful for researchers with limited computing resources.	18
baai-wudao/model	A repository of pre-trained language models for various tasks and domains.	121
cluebenchmark/cluepretrainedmodels	Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models.	806
csuhan/onellm	A framework for training and fine-tuning multimodal language models on various data types	601
thudm/chinese-transformer-xl	A pre-trained Chinese language model based on the Transformer-XL architecture.	218
zhuiyitechnology/wobert	A Word-based Chinese BERT model trained on large-scale text data using pre-trained models as a foundation	460
zhuiyitechnology/roformer-sim	An upgraded version of SimBERT with integrated retrieval and generation capabilities	441
langboat/mengzi3	An 8B and 13B language model based on the Llama architecture with multilingual capabilities.	2,031