jailbreak_llms
Prompt dataset
This dataset collects 15,140 prompts from various platforms to measure the vulnerability of large language models.
[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).
3k stars
36 watching
255 forks
Language: Jupyter Notebook
last commit: 3 months ago
Linked from 1 awesome list
chatgptjailbreaklarge-language-modelllmllm-securityprompt
Related projects:
Repository | Description | Stars |
---|---|---|
| A platform providing data, models, and evaluation benchmarks for large language models to promote accessibility and democratization of AI technology | 2,938 |
| A curated list of resources to help developers navigate the landscape of large language models and their applications in NLP | 9,551 |
| An open platform for training, serving, and evaluating large language models used in chatbots. | 37,269 |
| An experiment to explore and push the limits of ChatGPT's capabilities by using clever workarounds to bypass its restrictions. | 6,563 |
| A curated collection of high-quality datasets for training large language models. | 2,708 |
| A collection of GPT system prompts and various prompt injection/leaking knowledge to educate developers about writing effective system prompts and creating custom GPTs. | 8,375 |
| Provides a unified interface for fine-tuning large language models with parameter-efficient methods and instruction collection data | 2,640 |
| A tool for testing and evaluating large language models (LLMs) to ensure they are reliable and secure | 4,976 |
| Large-scale dialogue data and models for training chatbots and conversational AI systems | 2,276 |
| An open-source chatbot platform using large language models and vector databases | 2,707 |
| A framework for training and serving large language models using JAX/Flax | 2,428 |
| A platform that enables concurrent interaction with multiple AI chatbots to find the best answers. | 15,332 |
| Provides a unified framework to test generative language models on various evaluation tasks. | 7,200 |
| An ExpressJS middleware that allows users to execute LLM prompts stored in a git repository and retrieve results from a chosen model. | 74 |
| A toolkit for optimizing and serving large language models | 4,854 |