lm-human-preferences

language model tuning

Training methods and tools for fine-tuning language models using human preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

GitHub

1k stars
23 watching
163 forks
Language: Python
last commit: over 1 year ago

Related projects:

Repository Description Stars
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,160
flagai-open/aquila2 Provides pre-trained language models and tools for fine-tuning and evaluation 437
google-research/flan A repository providing tools and datasets to fine-tune language models for specific tasks 1,474
huggingface/pytorch-openai-transformer-lm Implementing OpenAI's transformer language model in PyTorch with pre-trained weights and fine-tuning capabilities 1,511
csuhan/onellm A framework for training and fine-tuning multimodal language models on various data types 588
vhellendoorn/code-lms A guide to using pre-trained large language models in source code analysis and generation 1,782
lge-arc-advancedai/auptimizer Automates model building and deployment process by optimizing hyperparameters and compressing models for edge computing. 200
bilibili/index-1.9b A lightweight, multilingual language model with a long context length 904
jshilong/gpt4roi Training and deploying large language models on computer vision tasks using region-of-interest inputs 506
luogen1996/lavin An open-source implementation of a vision-language instructed large language model 508
r2d4/openlm Library that provides a unified API to interact with various Large Language Models (LLMs) 366
ethanyanjiali/minchatgpt This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2. 213
brightmart/xlnet_zh Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks 230
openai/generating-reviews-discovering-sentiment Generates reviews and discovers sentiment using a language model 1,510
apache/opennlp-models Provides pre-trained models for text processing in various languages 4