FLAN

Language model tuner

A repository providing tools and datasets to fine-tune language models for specific tasks

1k stars

32 watching

156 forks

Language: Python

last commit: almost 2 years ago

Linked from 1 awesome list

Backlinks from these awesome lists:

yaodongc/awesome-instruction-dataset

Related projects:

Repository	Description	Stars
openai/finetune-transformer-lm	This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture.	2,167
openai/lm-human-preferences	Training methods and tools for fine-tuning language models using human preferences	1,240
leks-forever/nllb-tuning	This is an experimental project for fine-tuning the NLB language model with a specific dataset and evaluating its performance on translation tasks.	7
thunlp/ernie	A toolkit for fine-tuning pre-trained language models with knowledge graph representations to improve performance on entity typing and relation classification tasks.	1,413
google-deepmind/recurrentgemma	An implementation of a fast and efficient language model architecture	613
elanmart/psmm	An implementation of a neural network model for character-level language modeling.	50
csuhan/onellm	A framework for training and fine-tuning multimodal language models on various data types	601
ibm-granite/granite-3.0-language-models	A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources.	232
spandan-madan/pytorch_fine_tuning_tutorial	Provides guidance on fine-tuning pre-trained models for image classification tasks using PyTorch.	279
ieit-yuan/yuan2.0-m32	A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation	182
deepset-ai/farm	An open-source framework for adapting representation models to various tasks and industries	1,743
felixgithub2017/mmcu	Measures the understanding of massive multitask Chinese datasets using large language models	87
facebookresearch/compilergym	A reinforcement learning environment library for compiler optimization tasks	917
google-research/relay-policy-learning	Environments and data for training reinforcement learning agents in a kitchen simulator	108
nvidia/sentiment-discovery	Large-scale unsupervised language modeling for robust sentiment classification and related NLP tasks	1,061