DecodingTrust
GPT model trustworthiness assessor
An assessment tool for evaluating trustworthiness in GPT models across various aspects such as toxicity, bias, robustness, and fairness.
A Comprehensive Assessment of Trustworthiness in GPT Models
259 stars
6 watching
56 forks
Language: Python
last commit: 2 months ago Related projects:
Repository | Description | Stars |
---|---|---|
howiehwong/trustllm | A toolkit for assessing trustworthiness in large language models | 466 |
geeks-of-data/knowledge-gpt | Extracts and stores information from various sources using AI models to generate answers. | 279 |
azure/pyrit | Automates security risk identification and red teaming in generative AI systems | 1,891 |
trusted-ai/aix360 | A toolkit for explaining complex AI models and data-driven insights | 1,633 |
kvignesh122/assetnewssentimentanalyzer | An application providing sentiment analysis tools for financial assets and securities using GPT models and Google search results | 115 |
angelognazzo/reliable-trustworthy-ai | An implementation of a DeepPoly-based verifier for robustness analysis in deep neural networks | 1 |
decron/whitebox-code-gpt | A repository of GPT-powered programming assistants to support developers in their work. | 205 |
sturdy-dev/codereview.gpt | Reviews Pull/Merge Requests using AI-powered chatbot | 560 |
0xeb/gpt-analyst | A resource repository providing tools and guides for analyzing and reverse engineering GPT models. | 181 |
wireghoul/graudit | A tool to identify potential security flaws in source code using static analysis and regular expressions. | 1,538 |
mattzcarey/code-review-gpt | An automated code review tool powered by Large Language Models that scans source code for potential issues and provides feedback | 1,600 |
guanghelee/neurips19-certificates-of-robustness | Tight certificates of adversarial robustness for randomly smoothed classifiers | 17 |
akamai-threat-research/mqtt-pwn | A tool for penetration testing and security assessment of MQTT brokers using various exploitation techniques. | 367 |
albertwy/gpt-4v-evaluation | An evaluation framework for GPT-4V models using data from An Early Evaluation of GPT-4V(ision) | 11 |
borealisai/advertorch | A toolbox for researching and evaluating robustness against attacks on machine learning models | 1,308 |