vllm-safety-benchmark

Vision model safety test

A benchmark for evaluating the safety and robustness of vision language models against adversarial attacks.

[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"

GitHub

72 stars

4 watching

3 forks

Language: Python

last commit: over 1 year ago

adversarial-attacksbenchmarkdatasetsllmmultimodal-llmrobustnesssafetyvision-language-model

Screenshot of UCSC-VLAA/vllm-safety-benchmark website

arxiv.org/abs/2311.16101

Related projects:

Repository	Description	Stars
aifeg/benchlmm	An open-source benchmarking framework for evaluating cross-style visual capability of large multimodal models	84
ucsc-vlaa/sight-beyond-text	An implementation of a multimodal LLM training paradigm to enhance truthfulness and ethics in language models	19
ys-zong/vlguard	Improves safety and helpfulness of large language models by fine-tuning them using safety-critical tasks	47
safellama/plexiglass	A toolkit to detect and protect against vulnerabilities in Large Language Models.	122
leondz/lm_risk_cards	A set of tools and guidelines for assessing the security vulnerabilities of language models in AI applications	28
pku-alignment/safety-gymnasium	A unified benchmark for safe reinforcement learning algorithms and environments.	410
howiehwong/trustllm	A toolkit for assessing trustworthiness in large language models	491
hendrycks/robustness	Evaluates and benchmarks the robustness of deep learning models to various corruptions and perturbations in computer vision tasks.	1,030
opengvlab/visionllm	A large language model designed to process and generate visual information	956
byungkwanlee/collavo	Develops a PyTorch implementation of an enhanced vision language model	93
mlpc-ucsd/bliva	A multimodal LLM designed to handle text-rich visual questions	270
baaivision/eve	A PyTorch implementation of an encoder-free vision-language model that can be fine-tuned for various tasks and modalities	246
dvlab-research/lisa	A system that uses large language models to generate segmentation masks for images based on complex queries and world knowledge.	1,923
kaiyangzhou/dassl.pytorch	A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision.	1,236
ailab-cvc/seed-bench	A benchmark for evaluating large language models' ability to process multimodal input	322