sailor-llm
Language models for SEA
A collection of pre-trained language models designed to support the diverse linguistic needs of South-East Asia
[EMNLP-2024] ⚓️ Sailor: Open Language Models for South-East Asia
109 stars
7 watching
9 forks
Language: Python
last commit: 3 months ago indonesialanguage-modellaomalayseathaivietnam
Related projects:
Repository | Description | Stars |
---|---|---|
seallms/seallms | Large language models designed to process languages commonly used in Southeast Asia | 8 |
langboat/mengzi3 | An 8B and 13B language model based on the Llama architecture with multilingual capabilities. | 2,032 |
ibm-granite/granite-3.0-language-models | A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources. | 214 |
bilibili/index-1.9b | A lightweight, multilingual language model with a long context length | 904 |
elanmart/psmm | An implementation of a neural network model for character-level language modeling. | 50 |
vhellendoorn/code-lms | A guide to using pre-trained large language models in source code analysis and generation | 1,782 |
xverse-ai/xverse-7b | A multilingual large language model developed by XVERSE Technology Inc. | 50 |
orionstarai/orion | A family of large language models designed to handle multilingual text and provide strong performance in various tasks such as chat, long context, and retrieval augmented generation. | 785 |
yunwentechnology/unilm | This project provides pre-trained models for natural language understanding and generation tasks using the UniLM architecture. | 438 |
csuhan/onellm | A framework for training and fine-tuning multimodal language models on various data types | 588 |
nanbeige/nanbeige | Develops large language models for text understanding and generation tasks. | 85 |
academic-hammer/hammerllm | A large language model pre-trained on Chinese and English data, suitable for natural language processing tasks. | 43 |
ieit-yuan/yuan2.0-m32 | A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation | 180 |
eleutherai/polyglot | Large language models designed to perform well in multiple languages and address performance issues with current multilingual models. | 475 |
apache/opennlp-models | Distributes pre-trained models for natural language text processing tasks in various languages | 4 |