granite-3.0-language-models

Language models

A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources.

GitHub

214 stars
6 watching
20 forks
last commit: 13 days ago

Related projects:

Repository Description Stars
ieit-yuan/yuan2.0-m32 A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation 180
eleutherai/polyglot Large language models designed to perform well in multiple languages and address performance issues with current multilingual models. 475
elanmart/psmm An implementation of a neural network model for character-level language modeling. 50
google-deepmind/recurrentgemma An implementation of a fast and efficient language model architecture 607
baai-wudao/model A repository of pre-trained language models for various tasks and domains. 121
turkunlp/wikibert Provides pre-trained language models derived from Wikipedia texts for natural language processing tasks 34
bilibili/index-1.9b A lightweight, multilingual language model with a long context length 904
xverse-ai/xverse-13b A large language model developed to support multiple languages and applications 649
vhellendoorn/code-lms A guide to using pre-trained large language models in source code analysis and generation 1,782
shawn-ieitsystems/yuan-1.0 Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing 591
microsoft/unicoder This repository provides pre-trained models and code for understanding and generation tasks in multiple languages. 88
agrigpts/agrigpts Developing agricultural large language models to support research and practical applications in agriculture. 22
yunwentechnology/unilm This project provides pre-trained models for natural language understanding and generation tasks using the UniLM architecture. 438
felixgithub2017/mmcu Evaluates the semantic understanding capabilities of large Chinese language models using a multimodal dataset. 87
xverse-ai/xverse-65b A large language model developed by XVERSE Technology Inc. using transformer architecture and fine-tuned on diverse data sets for various applications. 132