Chinese-Mixtral-8x7B
Chinese Text Model
An implementation of a large language model for Chinese text processing, focusing on MoE (Multi-Headed Attention) architecture and incorporating a vast vocabulary.
中文Mixtral-8x7B(Chinese-Mixtral-8x7B)
645 stars
15 watching
32 forks
Language: Python
last commit: 6 months ago large-language-modelsllmmixtral-8x7bnlp
Related projects:
Repository | Description | Stars |
---|---|---|
| Develops and releases Mixtral-based models for natural language processing tasks with a focus on Chinese text generation and understanding | 589 |
| Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks | 230 |
| Measures the understanding of massive multitask Chinese datasets using large language models | 87 |
| Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. | 806 |
| A pre-trained Chinese language model based on the Transformer-XL architecture. | 218 |
| A benchmarking dataset and evaluation framework for semantic dependency parsing in Chinese language texts. | 135 |
| A multi-modal large language model that integrates natural language and visual capabilities with fine-tuning for various tasks | 73 |
| A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation | 182 |
| This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese. | 439 |
| Trains and evaluates a Chinese language model using adversarial training on a large corpus. | 140 |
| This project makes a large language model accessible for research and development | 1,245 |
| An implementation of a Chinese language model using PyTorch and transformer architecture. | 262 |
| This repository provides pre-trained models and code for understanding and generation tasks in multiple languages. | 89 |
| Provides pre-trained models for Chinese natural language processing tasks using the XLNet architecture | 1,652 |
| Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing | 591 |