HarmBench
Attack simulator
A standardized framework for evaluating and improving the robustness of large language models against adversarial attacks
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
366 stars
6 watching
59 forks
Language: Jupyter Notebook
last commit: 6 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A standardized benchmark for measuring the robustness of machine learning models against adversarial attacks | 682 |
| A tool designed to simulate system compromise or attack behaviors without running processes or PoCs. | 271 |
| This repository provides a setup and framework for investigating irreversible backdoor attacks in Federated Learning systems. | 31 |
| A collection of educational scripts and payloads for simulating vulnerabilities and malware attacks on Windows systems using custom hardware. | 60 |
| A collaboration to create realistic test environments for simulating real-world attacks and improving detection strategies. | 704 |
| An adversarial simulation tool to test information security preparedness by simulating network-based attacks on various systems. | 1,103 |
| A toolset for creating and automating customized security events to simulate realistic scenarios for testing and training | 998 |
| A toolbox for researching and evaluating robustness against attacks on machine learning models | 1,311 |
| Utilities for simulating adversary behavior in the context of threat intelligence and security analysis | 1,011 |
| A framework for attacking federated learning systems with adaptive backdoor attacks | 23 |
| An intrusion detection system designed to capture and analyze ssh interactions between an attacker and a modified OpenSSH deamon | 26 |
| A tool to simulate attacks against virtual environments and collect data into Splunk for detection development | 2,181 |
| A tool designed to simulate malicious behavior against Google Workspace environments for threat research and detection rule effectiveness testing | 163 |
| A tool for demonstrating and analyzing attacks on federated learning systems by introducing backdoors into distributed machine learning models. | 179 |
| A comprehensive cyber adversary simulation platform for planning and conducting simulated attacks and exercises | 765 |