Mengzi3

Language Model

An 8B and 13B language model based on the Llama architecture with multilingual capabilities.

GitHub

2k stars
73 watching
31 forks
Language: Python
last commit: 3 months ago

Related projects:

Repository Description Stars
langboat/mengzi Develops lightweight yet powerful pre-trained models for natural language processing tasks 533
bilibili/index-1.9b A lightweight, multilingual language model with a long context length 920
xverse-ai/xverse-13b A large language model developed to support multiple languages and applications 648
sail-sg/sailor-llm Develops language models tailored for South-East Asia's linguistic diversity and cultural nuances 120
xverse-ai/xverse-7b A multilingual large language model developed by XVERSE Technology Inc. 50
xverse-ai/xverse-moe-a36b Develops and publishes large multilingual language models with advanced mixing-of-experts architecture. 37
brightmart/xlnet_zh Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks 230
yunwentechnology/unilm This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese. 439
elanmart/psmm An implementation of a neural network model for character-level language modeling. 50
xverse-ai/xverse-65b A large language model developed by XVERSE Technology Inc. using transformer architecture and fine-tuned on diverse data sets for various applications. 132
linksoul-ai/chinese-llama-2-7b A deep learning project providing an open-source implementation of the LLaMA2 model with Chinese and English text data 2,235
flagai-open/aquila2 Provides pre-trained language models and tools for fine-tuning and evaluation 439
michael-wzhu/chinese-llama2 A custom Chinese version of the Meta Llama 2 model for improved Chinese language support and application 748
ieit-yuan/yuan2.0-m32 A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation 182
zhuiyitechnology/pretrained-models A collection of pre-trained language models for natural language processing tasks 989