awesome-prompt-injection
Vulnerability assessment
Provides resources and information on a type of vulnerability targeting machine learning models
Learn about a type of vulnerability that specifically targets machine learning models
183 stars
6 watching
32 forks
last commit: 5 months ago
Linked from 1 awesome list
awesomeawesome-list
Awesome Prompt Injection / Articles and Blog posts | |||
Prompt injection: What's the worst that can happen? | General overview of Prompt Injection attacks, part of a series | ||
ChatGPT Plugins: Data Exfiltration via Images & Cross Plugin Request Forgery | This post shows how a malicious website can take control of a ChatGPT chat session and exfiltrate the history of the conversation | ||
Data exfiltration via Indirect Prompt Injection in ChatGPT | This post explores two prompt injections in OpenAI's browsing plugin for ChatGPT. These techniques exploit the input-dependent nature of AI conversational models, allowing an attacker to exfiltrate data through several prompt injection methods, posing significant privacy and security risks | ||
Prompt Injection Cheat Sheet: How To Manipulate AI Language Models | A prompt injection cheat sheet for AI bot integrations | ||
Prompt injection explained | Video, slides, and a transcript of an introduction to prompt injection and why it's important | ||
Adversarial Prompting | A guide on the various types of adversarial prompting and ways to mitigate them | ||
Don't you (forget NLP): Prompt injection with control characters in ChatGPT | A look into how to achieve prompt injection from control characters from Dropbox | ||
Testing the Limits of Prompt Injection Defence | A practical discussion about the unique complexities of securing LLMs from prompt injection attacks | ||
Awesome Prompt Injection / Tutorials | |||
Prompt Injection | Prompt Injection tutorial from Learn Prompting | ||
AI Read Teaming from Google | Google's red team walkthrough of hacking AI systems | ||
Awesome Prompt Injection / Tools | |||
Token Turbulenz | 13 | over 1 year ago | A fuzzer to automate looking for possible Prompt Injections |
Garak | 1,471 | 8 days ago | Automate looking for hallucination, data leakage, prompt injection, misinformation, toxicity generation, jailbreaks, and many other weaknesses in LLM's |
Awesome Prompt Injection / CTF | |||
Promptalanche | As well as traditional challenges, this CTF also introduce scenarios that mimic agents in real-world applications | ||
Gandalf | Your goal is to make Gandalf reveal the secret password for each level. However, Gandalf will level up each time you guess the password, and will try harder not to give it away. Can you beat level 7? (There is a bonus level 8) | ||
ChatGPT with Browsing is drunk! There is more to it than you might expect at first glance | This riddle requires you to have ChatGPT Plus access and enable the Browsing mode in Settings->Beta Features | ||
Awesome Prompt Injection / Community | |||
Learn Prompting | Discord server from Learn Prompting |