Mengzi3
Language Model
An 8B and 13B language model based on the Llama architecture with multilingual capabilities.
2k stars
73 watching
31 forks
Language: Python
last commit: about 1 month ago Related projects:
Repository | Description | Stars |
---|---|---|
langboat/mengzi | Develops lightweight yet powerful pre-trained models for natural language processing tasks | 534 |
bilibili/index-1.9b | A lightweight, multilingual language model with a long context length | 904 |
xverse-ai/xverse-13b | A large language model developed to support multiple languages and applications | 649 |
sail-sg/sailor-llm | A collection of pre-trained language models designed to support the diverse linguistic needs of South-East Asia | 109 |
xverse-ai/xverse-7b | A multilingual large language model developed by XVERSE Technology Inc. | 50 |
xverse-ai/xverse-moe-a36b | Develops and publishes large multilingual language models with advanced mixing-of-experts architecture. | 36 |
brightmart/xlnet_zh | Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks | 230 |
yunwentechnology/unilm | This project provides pre-trained models for natural language understanding and generation tasks using the UniLM architecture. | 438 |
elanmart/psmm | An implementation of a neural network model for character-level language modeling. | 50 |
xverse-ai/xverse-65b | A large language model developed by XVERSE Technology Inc. using transformer architecture and fine-tuned on diverse data sets for various applications. | 132 |
linksoul-ai/chinese-llama-2-7b | A deep learning project providing an open-source implementation of the LLaMA2 model with Chinese and English text data | 2,228 |
flagai-open/aquila2 | Provides pre-trained language models and tools for fine-tuning and evaluation | 437 |
michael-wzhu/chinese-llama2 | A custom Chinese version of the Meta Llama 2 model for improved Chinese language support and application | 747 |
ieit-yuan/yuan2.0-m32 | A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation | 180 |
zhuiyitechnology/pretrained-models | A collection of pre-trained language models for natural language processing tasks | 987 |