promptbench
A unified evaluation framework for large language models
2k stars
21 watching
179 forks
Language: Python
last commit: 23 days ago
Linked from 1 awesome list
adversarial-attacksbenchmarkchatgptevaluationlarge-language-modelspromptprompt-engineeringrobustness