VLGuard

Safety fine-tuner

Improves safety and helpfulness of large language models by fine-tuning them using safety-critical tasks

[ICML 2024] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models.

GitHub

45 stars
3 watching
1 forks
Language: Python
last commit: 3 months ago
alignmentlarge-language-modelslarge-vision-language-modelssafetyvision-language-model

Related projects:

Repository Description Stars
roboflow/maestro A tool to streamline fine-tuning of multimodal models for vision-language tasks 1,386
ys-zong/vl-icl A benchmarking suite for multimodal in-context learning models 28
icoz69/stablellava A tool for generating and evaluating multimodal Large Language Models with visual instruction tuning capabilities 91
x-plug/cvalues Evaluates and aligns the values of Chinese large language models with safety and responsibility standards 477
rlhf-v/rlhf-v Aligns large language models' behavior through fine-grained correctional human feedback to improve trustworthiness and accuracy. 233
ucsc-vlaa/vllm-safety-benchmark A benchmark for evaluating the safety and robustness of vision language models against adversarial attacks. 67
yfzhang114/llava-align Debiasing techniques to minimize hallucinations in large visual language models 71
liaoning97/revo-lion A comprehensive dataset and evaluation framework for Vision-Language Instruction Tuning models 11
ymcui/macbert Improves pre-trained Chinese language models by incorporating a correction task to alleviate inconsistency issues with downstream tasks 645
yiyangzhou/lure Analyzing and mitigating object hallucination in large vision-language models to improve their accuracy and reliability. 134
spandan-madan/pytorch_fine_tuning_tutorial Provides guidance on fine-tuning pre-trained models for image classification tasks using PyTorch. 279
pku-alignment/safety-gymnasium A unified benchmark for safe reinforcement learning algorithms and environments. 394
pku-yuangroup/languagebind Extending pretraining models to handle multiple modalities by aligning language and video representations 723
jerry1993-tech/cornucopia-llama-fin-chinese A Chinese finance-focused large language model fine-tuning framework 589
brightmart/xlnet_zh Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks 230