Chinese-Mixtral-8x7B

Chinese Text Model

An implementation of a large language model for Chinese text processing, focusing on MoE (Multi-Headed Attention) architecture and incorporating a vast vocabulary.

中文Mixtral-8x7B（Chinese-Mixtral-8x7B）

GitHub

645 stars

15 watching

32 forks

Language: Python

last commit: 12 months ago

large-language-modelsllmmixtral-8x7bnlp

Related projects:

Repository	Description	Stars
ymcui/chinese-mixtral	Develops and releases Mixtral-based models for natural language processing tasks with a focus on Chinese text generation and understanding	589
brightmart/xlnet_zh	Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks	230
felixgithub2017/mmcu	Measures the understanding of massive multitask Chinese datasets using large language models	87
cluebenchmark/cluepretrainedmodels	Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models.	806
thudm/chinese-transformer-xl	A pre-trained Chinese language model based on the Transformer-XL architecture.	218
hit-scir/semeval-2016	A benchmarking dataset and evaluation framework for semantic dependency parsing in Chinese language texts.	135
pleisto/yuren-baichuan-7b	A multi-modal large language model that integrates natural language and visual capabilities with fine-tuning for various tasks	73
ieit-yuan/yuan2.0-m32	A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation	182
yunwentechnology/unilm	This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese.	439
cluebenchmark/electra	Trains and evaluates a Chinese language model using adversarial training on a large corpus.	140
tencent/tencent-hunyuan-large	This project makes a large language model accessible for research and development	1,245
lonepatient/nezha_chinese_pytorch	An implementation of a Chinese language model using PyTorch and transformer architecture.	262
microsoft/unicoder	This repository provides pre-trained models and code for understanding and generation tasks in multiple languages.	89
ymcui/chinese-xlnet	Provides pre-trained models for Chinese natural language processing tasks using the XLNet architecture	1,652
shawn-ieitsystems/yuan-1.0	Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing	591