Yuan2.0-M32

Language Model

A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation

Mixture-of-Experts (MoE) Language Model

GitHub

182 stars
3 watching
41 forks
Language: Python
last commit: over 1 year ago

Related projects:

Repository Description Stars
shawn-ieitsystems/yuan-1.0 Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing 591
ieit-yuan/yuan-2.0 An open-source large language model framework for building conversational AI applications 681
yunwentechnology/unilm This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese. 439
ibm-granite/granite-3.0-language-models A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources. 232
ymcui/lert A pre-trained language model designed to leverage linguistic features and outperform comparable baselines on Chinese natural language understanding tasks. 202
01-ai/yi A series of large language models trained from scratch to excel in multiple NLP tasks 7,743
elanmart/psmm An implementation of a neural network model for character-level language modeling. 50
felixgithub2017/mmcu Measures the understanding of massive multitask Chinese datasets using large language models 87
xverse-ai/xverse-moe-a36b Develops and publishes large multilingual language models with advanced mixing-of-experts architecture. 37
xverse-ai/xverse-13b A large language model developed to support multiple languages and applications 648
zhuiyitechnology/roformer-sim An upgraded version of SimBERT with integrated retrieval and generation capabilities 441
clue-ai/chatyuan Large language model for dialogue support in multiple languages 1,903
tencent/tencent-hunyuan-large This project makes a large language model accessible for research and development 1,245
xverse-ai/xverse-65b A large language model developed by XVERSE Technology Inc. using transformer architecture and fine-tuned on diverse data sets for various applications. 132
clue-ai/chatyuan-7b An updated version of a large language model designed to improve performance on multiple tasks and datasets 13