Chinese-Mixtral-8x7B

Chinese Text Model

An implementation of a large language model for Chinese text processing, focusing on MoE (Multi-Headed Attention) architecture and incorporating a vast vocabulary.

中文Mixtral-8x7B(Chinese-Mixtral-8x7B)

GitHub

641 stars
15 watching
32 forks
Language: Python
last commit: 3 months ago
large-language-modelsllmmixtral-8x7bnlp

Related projects:

Repository Description Stars
ymcui/chinese-mixtral Develops and releases Mixtral-based models for natural language processing tasks with a focus on Chinese text generation and understanding 584
brightmart/xlnet_zh Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks 230
felixgithub2017/mmcu Evaluates the semantic understanding capabilities of large Chinese language models using a multimodal dataset. 87
cluebenchmark/cluepretrainedmodels Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. 804
thudm/chinese-transformer-xl A pre-trained Chinese language model based on the Transformer-XL architecture. 218
hit-scir/semeval-2016 A benchmarking dataset and evaluation framework for semantic dependency parsing in Chinese language texts. 135
pleisto/yuren-baichuan-7b A multi-modal large language model that integrates natural language and visual capabilities with fine-tuning for various tasks 72
ieit-yuan/yuan2.0-m32 A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation 180
yunwentechnology/unilm This project provides pre-trained models for natural language understanding and generation tasks using the UniLM architecture. 438
cluebenchmark/electra Trains and evaluates a Chinese language model using adversarial training on a large corpus. 140
tencent/tencent-hunyuan-large This project makes a large language model accessible for research and development 1,114
lonepatient/nezha_chinese_pytorch An implementation of a Chinese language model using PyTorch and transformer architecture. 262
microsoft/unicoder This repository provides pre-trained models and code for understanding and generation tasks in multiple languages. 88
ymcui/chinese-xlnet Provides pre-trained models for Chinese natural language processing tasks using the XLNet architecture 1,653
shawn-ieitsystems/yuan-1.0 Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing 591