Unilm

Chinese NLU/NGL toolkit

This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese.

GitHub

439 stars
6 watching
86 forks
Language: Python
last commit: almost 3 years ago

Related projects:

Repository Description Stars
ieit-yuan/yuan2.0-m32 A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation 182
tsinghuaai/cpm Develops large-scale pre-trained models for Chinese natural language understanding and generative tasks with the goal of building efficient and effective models for various applications. 163
zhuiyitechnology/pretrained-models A collection of pre-trained language models for natural language processing tasks 989
ymcui/chinese-xlnet Provides pre-trained models for Chinese natural language processing tasks using the XLNet architecture 1,652
brightmart/xlnet_zh Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks 230
tencent/tencent-hunyuan-large This project makes a large language model accessible for research and development 1,245
shawn-ieitsystems/yuan-1.0 Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing 591
nkcs-iclab/linglong A pre-trained Chinese language model with a modest parameter count, designed to be accessible and useful for researchers with limited computing resources. 18
baai-wudao/model A repository of pre-trained language models for various tasks and domains. 121
cluebenchmark/cluepretrainedmodels Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. 806
csuhan/onellm A framework for training and fine-tuning multimodal language models on various data types 601
thudm/chinese-transformer-xl A pre-trained Chinese language model based on the Transformer-XL architecture. 218
zhuiyitechnology/wobert A Word-based Chinese BERT model trained on large-scale text data using pre-trained models as a foundation 460
zhuiyitechnology/roformer-sim An upgraded version of SimBERT with integrated retrieval and generation capabilities 441
langboat/mengzi3 An 8B and 13B language model based on the Llama architecture with multilingual capabilities. 2,031