alignment-handbook

Alignment Recipes

Provides recipes and guidelines for training language models to align with human preferences and AI goals

Robust recipes to align language models with human and AI preferences

GitHub

5k stars

112 watching

417 forks

Language: Python

last commit: 8 months ago

llmrlhftransformers

Screenshot of huggingface/alignment-handbook website

huggingface.co/HuggingFaceH4

Related projects:

Repository	Description	Stars
zjh-819/llmdatahub	A curated collection of high-quality datasets for training large language models.	2,708
thunlp/promptpapers	A curated list of papers on prompt-based tuning for pre-trained language models, providing insights and advancements in the field.	4,112
ethanyanjiali/minchatgpt	This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2.	214
openai/lm-human-preferences	Training methods and tools for fine-tuning language models using human preferences	1,240
huggingface/trl	A library designed to train transformer language models with reinforcement learning using various optimization techniques and fine-tuning methods.	10,308
huggingface/lerobot	A platform providing pre-trained models, datasets, and tools for robotics with focus on imitation learning and reinforcement learning.	7,874
huggingface/peft	An efficient method for fine-tuning large pre-trained models by adapting only a small fraction of their parameters	16,699
haotian-liu/llava	A system that uses large language and vision models to generate and process visual instructions	20,683
stability-ai/stablelm	Develops and maintains large language models with improved stability and performance	15,829
dair-ai/ml-papers-explained	An explanation of key concepts and advancements in the field of Machine Learning	7,352
huggingface/transformers	A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects.	136,357
haifengl/smile	A comprehensive machine learning framework that provides a wide range of algorithms and data structures for tasks such as classification, regression, clustering, and visualization.	6,066
instruction-tuning-with-gpt-4/gpt-4-llm	This project generates instruction-following data using GPT-4 to fine-tune large language models for real-world tasks.	4,244
huggingface/text-generation-inference	A toolkit for deploying and serving Large Language Models (LLMs) for high-performance text generation	9,456
microsoft/flaml	Automates machine learning workflows and optimizes model performance using large language models and efficient algorithms	3,968