Chinese-Mixtral-8x7B
Chinese Text Model
An implementation of a large language model for Chinese text processing, focusing on MoE (Multi-Headed Attention) architecture and incorporating a vast vocabulary.
中文Mixtral-8x7B(Chinese-Mixtral-8x7B)
641 stars
15 watching
32 forks
Language: Python
last commit: 3 months ago large-language-modelsllmmixtral-8x7bnlp
Related projects:
Repository | Description | Stars |
---|---|---|
ymcui/chinese-mixtral | Develops and releases Mixtral-based models for natural language processing tasks with a focus on Chinese text generation and understanding | 584 |
brightmart/xlnet_zh | Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks | 230 |
felixgithub2017/mmcu | Evaluates the semantic understanding capabilities of large Chinese language models using a multimodal dataset. | 87 |
cluebenchmark/cluepretrainedmodels | Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. | 804 |
thudm/chinese-transformer-xl | A pre-trained Chinese language model based on the Transformer-XL architecture. | 218 |
hit-scir/semeval-2016 | A benchmarking dataset and evaluation framework for semantic dependency parsing in Chinese language texts. | 135 |
pleisto/yuren-baichuan-7b | A multi-modal large language model that integrates natural language and visual capabilities with fine-tuning for various tasks | 72 |
ieit-yuan/yuan2.0-m32 | A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation | 180 |
yunwentechnology/unilm | This project provides pre-trained models for natural language understanding and generation tasks using the UniLM architecture. | 438 |
cluebenchmark/electra | Trains and evaluates a Chinese language model using adversarial training on a large corpus. | 140 |
tencent/tencent-hunyuan-large | This project makes a large language model accessible for research and development | 1,114 |
lonepatient/nezha_chinese_pytorch | An implementation of a Chinese language model using PyTorch and transformer architecture. | 262 |
microsoft/unicoder | This repository provides pre-trained models and code for understanding and generation tasks in multiple languages. | 88 |
ymcui/chinese-xlnet | Provides pre-trained models for Chinese natural language processing tasks using the XLNet architecture | 1,653 |
shawn-ieitsystems/yuan-1.0 | Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing | 591 |