VLGuard
Safety fine-tuner
Improves safety and helpfulness of large language models by fine-tuning them using safety-critical tasks
[ICML 2024] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models.
47 stars
3 watching
2 forks
Language: Python
last commit: 5 months ago alignmentlarge-language-modelslarge-vision-language-modelssafetyvision-language-model
Related projects:
Repository | Description | Stars |
---|---|---|
roboflow/maestro | A tool to streamline fine-tuning of multimodal models for vision-language tasks | 1,415 |
ys-zong/vl-icl | A benchmarking suite for multimodal in-context learning models | 31 |
icoz69/stablellava | A tool for generating and evaluating multimodal Large Language Models with visual instruction tuning capabilities | 93 |
x-plug/cvalues | Evaluates and aligns the values of Chinese large language models with safety and responsibility standards | 481 |
rlhf-v/rlhf-v | Aligns large language models' behavior through fine-grained correctional human feedback to improve trustworthiness and accuracy. | 245 |
ucsc-vlaa/vllm-safety-benchmark | A benchmark for evaluating the safety and robustness of vision language models against adversarial attacks. | 72 |
yfzhang114/llava-align | Debiasing techniques to minimize hallucinations in large visual language models | 75 |
liaoning97/revo-lion | A comprehensive dataset and evaluation framework for Vision-Language Instruction Tuning models | 11 |
ymcui/macbert | Improves pre-trained Chinese language models by incorporating a correction task to alleviate inconsistency issues with downstream tasks | 646 |
yiyangzhou/lure | Analyzing and mitigating object hallucination in large vision-language models to improve their accuracy and reliability. | 136 |
spandan-madan/pytorch_fine_tuning_tutorial | Provides guidance on fine-tuning pre-trained models for image classification tasks using PyTorch. | 279 |
pku-alignment/safety-gymnasium | A unified benchmark for safe reinforcement learning algorithms and environments. | 410 |
pku-yuangroup/languagebind | Extending pretraining models to handle multiple modalities by aligning language and video representations | 751 |
jerry1993-tech/cornucopia-llama-fin-chinese | A Chinese finance-focused large language model fine-tuning framework | 596 |
brightmart/xlnet_zh | Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks | 230 |