XVERSE-13B

Language Model

A large language model developed to support multiple languages and applications

XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.

GitHub

649 stars
18 watching
59 forks
Language: Python
last commit: 8 months ago

Related projects:

Repository Description Stars
xverse-ai/xverse-65b A large language model developed by XVERSE Technology Inc. using transformer architecture and fine-tuned on diverse data sets for various applications. 132
xverse-ai/xverse-moe-a36b Develops and publishes large multilingual language models with advanced mixing-of-experts architecture. 36
xverse-ai/xverse-7b A multilingual large language model developed by XVERSE Technology Inc. 50
xverse-ai/xverse-moe-a4.2b Developed by XVERSE Technology Inc. as a multilingual large language model with a unique mixture-of-experts architecture and fine-tuned for various tasks such as conversation, question answering, and natural language understanding. 36
xverse-ai/xverse-v-13b A large multimodal model for visual question answering, trained on a dataset of 2.1B image-text pairs and 8.2M instruction sequences. 77
langboat/mengzi3 An 8B and 13B language model based on the Llama architecture with multilingual capabilities. 2,032
ieit-yuan/yuan2.0-m32 A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation 180
shawn-ieitsystems/yuan-1.0 Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing 591
ibm-granite/granite-3.0-language-models A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources. 214
microsoft/unicoder This repository provides pre-trained models and code for understanding and generation tasks in multiple languages. 88
zhuiyitechnology/roformer-sim An upgraded version of SimBERT model with integrated retrieval and generation capabilities 438
01-ai/yi A series of large language models trained from scratch to excel in multiple NLP tasks 7,699
bilibili/index-1.9b A lightweight, multilingual language model with a long context length 904
elanmart/psmm An implementation of a neural network model for character-level language modeling. 50
yunwentechnology/unilm This project provides pre-trained models for natural language understanding and generation tasks using the UniLM architecture. 438