Chinese-Mixtral-8x7B

Chinese Text Model

An implementation of a large language model for Chinese text processing, focusing on MoE (Multi-Headed Attention) architecture and incorporating a vast vocabulary.

中文Mixtral-8x7B(Chinese-Mixtral-8x7B)

GitHub

645 stars
15 watching
32 forks
Language: Python
last commit: 5 months ago
large-language-modelsllmmixtral-8x7bnlp

Related projects:

Repository Description Stars
ymcui/chinese-mixtral Develops and releases Mixtral-based models for natural language processing tasks with a focus on Chinese text generation and understanding 589
brightmart/xlnet_zh Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks 230
felixgithub2017/mmcu Measures the understanding of massive multitask Chinese datasets using large language models 87
cluebenchmark/cluepretrainedmodels Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. 806
thudm/chinese-transformer-xl A pre-trained Chinese language model based on the Transformer-XL architecture. 218
hit-scir/semeval-2016 A benchmarking dataset and evaluation framework for semantic dependency parsing in Chinese language texts. 135
pleisto/yuren-baichuan-7b A multi-modal large language model that integrates natural language and visual capabilities with fine-tuning for various tasks 73
ieit-yuan/yuan2.0-m32 A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation 182
yunwentechnology/unilm This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese. 439
cluebenchmark/electra Trains and evaluates a Chinese language model using adversarial training on a large corpus. 140
tencent/tencent-hunyuan-large This project makes a large language model accessible for research and development 1,245
lonepatient/nezha_chinese_pytorch An implementation of a Chinese language model using PyTorch and transformer architecture. 262
microsoft/unicoder This repository provides pre-trained models and code for understanding and generation tasks in multiple languages. 89
ymcui/chinese-xlnet Provides pre-trained models for Chinese natural language processing tasks using the XLNet architecture 1,652
shawn-ieitsystems/yuan-1.0 Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing 591