lm-human-preferences
language model tuning
Training methods and tools for fine-tuning language models using human preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
1k stars
23 watching
163 forks
Language: Python
last commit: over 1 year ago Related projects:
Repository | Description | Stars |
---|---|---|
openai/finetune-transformer-lm | This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,160 |
flagai-open/aquila2 | Provides pre-trained language models and tools for fine-tuning and evaluation | 437 |
google-research/flan | A repository providing tools and datasets to fine-tune language models for specific tasks | 1,474 |
huggingface/pytorch-openai-transformer-lm | Implementing OpenAI's transformer language model in PyTorch with pre-trained weights and fine-tuning capabilities | 1,511 |
csuhan/onellm | A framework for training and fine-tuning multimodal language models on various data types | 588 |
vhellendoorn/code-lms | A guide to using pre-trained large language models in source code analysis and generation | 1,782 |
lge-arc-advancedai/auptimizer | Automates model building and deployment process by optimizing hyperparameters and compressing models for edge computing. | 200 |
bilibili/index-1.9b | A lightweight, multilingual language model with a long context length | 904 |
jshilong/gpt4roi | Training and deploying large language models on computer vision tasks using region-of-interest inputs | 506 |
luogen1996/lavin | An open-source implementation of a vision-language instructed large language model | 508 |
r2d4/openlm | Library that provides a unified API to interact with various Large Language Models (LLMs) | 366 |
ethanyanjiali/minchatgpt | This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2. | 213 |
brightmart/xlnet_zh | Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks | 230 |
openai/generating-reviews-discovering-sentiment | Generates reviews and discovers sentiment using a language model | 1,510 |
apache/opennlp-models | Provides pre-trained models for text processing in various languages | 4 |