Multimodal-Robustness-Benchmark
Robustness tester
Evaluates the robustness of large language models to leading questions
43 stars
2 watching
0 forks
Language: Python
last commit: 4 months ago Evaluates the robustness of large language models to leading questions