FLAN
Language model tuner
A repository providing tools and datasets to fine-tune language models for specific tasks
1k stars
32 watching
156 forks
Language: Python
last commit: 27 days ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
openai/finetune-transformer-lm | This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,160 |
openai/lm-human-preferences | Training methods and tools for fine-tuning language models using human preferences | 1,229 |
leks-forever/nllb-tuning | This is an experimental project for fine-tuning the NLB language model with a specific dataset and evaluating its performance on translation tasks. | 7 |
thunlp/ernie | A toolkit for fine-tuning pre-trained language models with knowledge graph representations to improve performance on entity typing and relation classification tasks. | 1,412 |
google-deepmind/recurrentgemma | An implementation of a fast and efficient language model architecture | 607 |
elanmart/psmm | An implementation of a neural network model for character-level language modeling. | 50 |
csuhan/onellm | A framework for training and fine-tuning multimodal language models on various data types | 588 |
ibm-granite/granite-3.0-language-models | A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources. | 214 |
spandan-madan/pytorch_fine_tuning_tutorial | Provides guidance on fine-tuning pre-trained models for image classification tasks using PyTorch. | 279 |
ieit-yuan/yuan2.0-m32 | A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation | 180 |
deepset-ai/farm | An open-source framework for adapting representation models to various tasks and industries | 1,741 |
felixgithub2017/mmcu | Evaluates the semantic understanding capabilities of large Chinese language models using a multimodal dataset. | 87 |
facebookresearch/compilergym | A reinforcement learning environment library for compiler optimization tasks | 914 |
google-research/relay-policy-learning | Environments and data for training reinforcement learning agents in a kitchen simulator | 107 |
nvidia/sentiment-discovery | Large-scale unsupervised language modeling for robust sentiment classification and related NLP tasks | 1,062 |