granite-3.0-language-models

Language models

A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources.

GitHub

232 stars
8 watching
22 forks
last commit: about 1 month ago

Related projects:

Repository Description Stars
ieit-yuan/yuan2.0-m32 A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation 182
eleutherai/polyglot Large language models designed to perform well in multiple languages and address performance issues with current multilingual models. 476
elanmart/psmm An implementation of a neural network model for character-level language modeling. 50
google-deepmind/recurrentgemma An implementation of a fast and efficient language model architecture 613
baai-wudao/model A repository of pre-trained language models for various tasks and domains. 121
turkunlp/wikibert Provides pre-trained language models derived from Wikipedia texts for natural language processing tasks 34
bilibili/index-1.9b A lightweight, multilingual language model with a long context length 920
xverse-ai/xverse-13b A large language model developed to support multiple languages and applications 648
vhellendoorn/code-lms A guide to using pre-trained large language models in source code analysis and generation 1,789
shawn-ieitsystems/yuan-1.0 Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing 591
microsoft/unicoder This repository provides pre-trained models and code for understanding and generation tasks in multiple languages. 89
agrigpts/agrigpts Developing large language models for agricultural applications to improve crop yields and support rural development. 22
yunwentechnology/unilm This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese. 439
felixgithub2017/mmcu Measures the understanding of massive multitask Chinese datasets using large language models 87
xverse-ai/xverse-65b A large language model developed by XVERSE Technology Inc. using transformer architecture and fine-tuned on diverse data sets for various applications. 132