PromptInject
Prompt analysis tool
A framework for analyzing the robustness of large language models to adversarial prompt attacks
PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. 🏆 Best Paper Awards @ NeurIPS ML Safety Workshop 2022
313 stars
11 watching
30 forks
Language: Python
last commit: 9 months ago adversarial-attacksagiagi-alignmentai-alignmentai-safetychain-of-thoughtgpt-3language-modelslarge-language-modelsmachine-learningml-safetyprompt-engineering
Related projects:
Repository | Description | Stars |
---|---|---|
protectai/rebuff | Protects AI applications from prompt injection attacks through multiple layers of defense | 1,124 |
jthack/pipe | A guide to help developers understand and mitigate the security risks of prompt injection in AI-powered applications and features. | 359 |
miesnerjacob/learn-prompting | A comprehensive resource for learning prompt engineering techniques for interacting with large language models. | 33 |
krrishdholakia/betterprompt | An API for evaluating the quality of text prompts used in Large Language Models (LLMs) based on perplexity estimation | 38 |
vaibkumr/prompt-optimizer | A tool to reduce the complexity of text prompts to minimize API costs and model computations. | 241 |
mitre/advmlthreatmatrix | A framework to help security analysts understand and prepare for adversarial machine learning attacks on AI systems | 1,050 |
microsoft/promptbench | A unified framework for evaluating large language models' performance and robustness in various scenarios. | 2,462 |
instadeepai/mava | A research-friendly codebase for experimenting with multi-agent reinforcement learning in JAX | 734 |
prompt-security/ps-fuzz | An interactive tool that tests and hardens the security of system prompts used in GenAI applications against various attacks. | 401 |
rafalzawadzki/spellbook-forge | An ExpressJS middleware that allows users to execute LLM prompts stored in a git repository and retrieve results from a chosen model. | 74 |
ga642381/speechprompt | An approach to leveraging pre-trained models for efficient speech processing tasks by using prompt tuning | 97 |
deadbits/vigil-llm | A security scanner for Large Language Model prompts to detect potential threats and vulnerabilities | 309 |
demisto/cops | Standardized framework for creating and sharing incident response processes in a shared language | 150 |
ncwilson78/system-prompt-library | A comprehensive collection of customizable prompts for Generative Pre-trained Transformers (GPTs) designed specifically for educational use. | 65 |
xcambar/purs | A Rust implementation of a minimal, fast, and aesthetically pleasing prompt system | 252 |