granite-3.0-language-models
Language models
A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources.
214 stars
6 watching
20 forks
last commit: 13 days ago Related projects:
Repository | Description | Stars |
---|---|---|
ieit-yuan/yuan2.0-m32 | A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation | 180 |
eleutherai/polyglot | Large language models designed to perform well in multiple languages and address performance issues with current multilingual models. | 475 |
elanmart/psmm | An implementation of a neural network model for character-level language modeling. | 50 |
google-deepmind/recurrentgemma | An implementation of a fast and efficient language model architecture | 607 |
baai-wudao/model | A repository of pre-trained language models for various tasks and domains. | 121 |
turkunlp/wikibert | Provides pre-trained language models derived from Wikipedia texts for natural language processing tasks | 34 |
bilibili/index-1.9b | A lightweight, multilingual language model with a long context length | 904 |
xverse-ai/xverse-13b | A large language model developed to support multiple languages and applications | 649 |
vhellendoorn/code-lms | A guide to using pre-trained large language models in source code analysis and generation | 1,782 |
shawn-ieitsystems/yuan-1.0 | Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing | 591 |
microsoft/unicoder | This repository provides pre-trained models and code for understanding and generation tasks in multiple languages. | 88 |
agrigpts/agrigpts | Developing agricultural large language models to support research and practical applications in agriculture. | 22 |
yunwentechnology/unilm | This project provides pre-trained models for natural language understanding and generation tasks using the UniLM architecture. | 438 |
felixgithub2017/mmcu | Evaluates the semantic understanding capabilities of large Chinese language models using a multimodal dataset. | 87 |
xverse-ai/xverse-65b | A large language model developed by XVERSE Technology Inc. using transformer architecture and fine-tuned on diverse data sets for various applications. | 132 |