Mengzi3
Language Model
An 8B and 13B language model based on the Llama architecture with multilingual capabilities.
2k stars
73 watching
31 forks
Language: Python
last commit: about 1 year ago Related projects:
Repository | Description | Stars |
---|---|---|
| Develops lightweight yet powerful pre-trained models for natural language processing tasks | 533 |
| A lightweight, multilingual language model with a long context length | 920 |
| A large language model developed to support multiple languages and applications | 648 |
| Develops language models tailored for South-East Asia's linguistic diversity and cultural nuances | 120 |
| A multilingual large language model developed by XVERSE Technology Inc. | 50 |
| Develops and publishes large multilingual language models with advanced mixing-of-experts architecture. | 37 |
| Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks | 230 |
| This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese. | 439 |
| An implementation of a neural network model for character-level language modeling. | 50 |
| A large language model developed by XVERSE Technology Inc. using transformer architecture and fine-tuned on diverse data sets for various applications. | 132 |
| A deep learning project providing an open-source implementation of the LLaMA2 model with Chinese and English text data | 2,235 |
| Provides pre-trained language models and tools for fine-tuning and evaluation | 439 |
| A custom Chinese version of the Meta Llama 2 model for improved Chinese language support and application | 748 |
| A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation | 182 |
| A collection of pre-trained language models for natural language processing tasks | 989 |