FLAN

Language model tuner

A repository providing tools and datasets to fine-tune language models for specific tasks

GitHub

1k stars
32 watching
156 forks
Language: Python
last commit: 27 days ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,160
openai/lm-human-preferences Training methods and tools for fine-tuning language models using human preferences 1,229
leks-forever/nllb-tuning This is an experimental project for fine-tuning the NLB language model with a specific dataset and evaluating its performance on translation tasks. 7
thunlp/ernie A toolkit for fine-tuning pre-trained language models with knowledge graph representations to improve performance on entity typing and relation classification tasks. 1,412
google-deepmind/recurrentgemma An implementation of a fast and efficient language model architecture 607
elanmart/psmm An implementation of a neural network model for character-level language modeling. 50
csuhan/onellm A framework for training and fine-tuning multimodal language models on various data types 588
ibm-granite/granite-3.0-language-models A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources. 214
spandan-madan/pytorch_fine_tuning_tutorial Provides guidance on fine-tuning pre-trained models for image classification tasks using PyTorch. 279
ieit-yuan/yuan2.0-m32 A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation 180
deepset-ai/farm An open-source framework for adapting representation models to various tasks and industries 1,741
felixgithub2017/mmcu Evaluates the semantic understanding capabilities of large Chinese language models using a multimodal dataset. 87
facebookresearch/compilergym A reinforcement learning environment library for compiler optimization tasks 914
google-research/relay-policy-learning Environments and data for training reinforcement learning agents in a kitchen simulator 107
nvidia/sentiment-discovery Large-scale unsupervised language modeling for robust sentiment classification and related NLP tasks 1,062