Yuan2.0-M32
Language Model
A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation
Mixture-of-Experts (MoE) Language Model
182 stars
3 watching
41 forks
Language: Python
last commit: over 1 year ago Related projects:
| Repository | Description | Stars |
|---|---|---|
| | Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing | 591 |
| | An open-source large language model framework for building conversational AI applications | 681 |
| | This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese. | 439 |
| | A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources. | 232 |
| | A pre-trained language model designed to leverage linguistic features and outperform comparable baselines on Chinese natural language understanding tasks. | 202 |
| | A series of large language models trained from scratch to excel in multiple NLP tasks | 7,743 |
| | An implementation of a neural network model for character-level language modeling. | 50 |
| | Measures the understanding of massive multitask Chinese datasets using large language models | 87 |
| | Develops and publishes large multilingual language models with advanced mixing-of-experts architecture. | 37 |
| | A large language model developed to support multiple languages and applications | 648 |
| | An upgraded version of SimBERT with integrated retrieval and generation capabilities | 441 |
| | Large language model for dialogue support in multiple languages | 1,903 |
| | This project makes a large language model accessible for research and development | 1,245 |
| | A large language model developed by XVERSE Technology Inc. using transformer architecture and fine-tuned on diverse data sets for various applications. | 132 |
| | An updated version of a large language model designed to improve performance on multiple tasks and datasets | 13 |