DecodingTrust
GPT model trustworthiness assessor
An assessment tool for evaluating trustworthiness in GPT models across various aspects such as toxicity, bias, robustness, and fairness.
A Comprehensive Assessment of Trustworthiness in GPT Models
267 stars
6 watching
57 forks
Language: Python
last commit: 4 months ago Related projects:
Repository | Description | Stars |
---|---|---|
howiehwong/trustllm | A toolkit for assessing trustworthiness in large language models | 491 |
geeks-of-data/knowledge-gpt | Extracts and stores information from various sources using AI models to generate answers. | 283 |
azure/pyrit | Empowers security professionals to identify risks in generative AI systems by providing a framework for proactive risk assessment and red teaming. | 1,977 |
trusted-ai/aix360 | A toolkit for explaining complex AI models and data-driven insights | 1,641 |
kvignesh122/assetnewssentimentanalyzer | An application providing sentiment analysis tools for financial assets and securities using GPT models and Google search results | 120 |
angelognazzo/reliable-trustworthy-ai | An implementation of a DeepPoly-based verifier for robustness analysis in deep neural networks | 2 |
decron/whitebox-code-gpt | A repository of GPT-powered programming assistants to support developers in their work. | 206 |
sturdy-dev/codereview.gpt | Reviews Pull/Merge Requests using AI-powered chatbot | 561 |
0xeb/gpt-analyst | A resource repository providing tools and guides for analyzing and reverse engineering GPT models. | 184 |
wireghoul/graudit | A tool to identify potential security flaws in source code using static analysis and regular expressions. | 1,548 |
mattzcarey/code-review-gpt | An automated code review tool powered by Large Language Models that scans source code for potential issues and provides feedback | 1,633 |
guanghelee/neurips19-certificates-of-robustness | Provides a framework for computing tight certificates of adversarial robustness for randomly smoothed classifiers. | 17 |
akamai-threat-research/mqtt-pwn | A tool for penetration testing and security assessment of MQTT brokers using various exploitation techniques. | 370 |
albertwy/gpt-4v-evaluation | An evaluation framework for GPT-4V models using data from An Early Evaluation of GPT-4V(ision) | 11 |
borealisai/advertorch | A toolbox for researching and evaluating robustness against attacks on machine learning models | 1,311 |