XVERSE-MoE-A4.2B
Mixture-of-Experts Model
Developed by XVERSE Technology Inc. as a multilingual large language model with a unique mixture-of-experts architecture and fine-tuned for various tasks such as conversation, question answering, and natural language understanding.
XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.
36 stars
5 watching
6 forks
Language: Python
last commit: over 1 year ago Related projects:
| Repository | Description | Stars |
|---|---|---|
| | Develops and publishes large multilingual language models with advanced mixing-of-experts architecture. | 37 |
| | A large language model developed to support multiple languages and applications | 648 |
| | A large language model developed by XVERSE Technology Inc. using transformer architecture and fine-tuned on diverse data sets for various applications. | 132 |
| | A large multimodal model for visual question answering, trained on a dataset of 2.1B image-text pairs and 8.2M instruction sequences. | 78 |
| | A multilingual large language model developed by XVERSE Technology Inc. | 50 |
| | A large vision-language model using a mixture-of-experts architecture to improve performance on multi-modal learning tasks | 2,023 |
| | An open-source implementation of the Mixture-of-Embeddings-Experts model in Pytorch for video-text retrieval tasks. | 118 |
| | A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation | 182 |
| | A high-performance mixture-of-experts model with innovative training techniques for language processing tasks | 126 |
| | A Python package implementing an interpretable machine learning model for text classification with visualization tools | 336 |
| | A large language model with improved efficiency and performance compared to similar models | 1,024 |
| | A method for scaling multimodal large language models by combining multiple experts and fine-tuning them together | 136 |
| | Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing | 591 |
| | Develops large multimodal models for high-resolution understanding and analysis of text, images, and other data types. | 143 |
| | Improves performance of vision language tasks by integrating computer vision capabilities into large language models | 314 |