Multimodal-Robustness-Benchmark
Robustness tester
Evaluates the robustness of large language models to leading questions
43 stars
2 watching
0 forks
Language: Python
last commit: about 1 year ago Evaluates the robustness of large language models to leading questions