DecodingTrust

GPT model trustworthiness assessor

An assessment tool for evaluating trustworthiness in GPT models across various aspects such as toxicity, bias, robustness, and fairness.

A Comprehensive Assessment of Trustworthiness in GPT Models

GitHub

259 stars
6 watching
56 forks
Language: Python
last commit: 2 months ago

Related projects:

Repository Description Stars
howiehwong/trustllm A toolkit for assessing trustworthiness in large language models 466
geeks-of-data/knowledge-gpt Extracts and stores information from various sources using AI models to generate answers. 279
azure/pyrit Automates security risk identification and red teaming in generative AI systems 1,891
trusted-ai/aix360 A toolkit for explaining complex AI models and data-driven insights 1,633
kvignesh122/assetnewssentimentanalyzer An application providing sentiment analysis tools for financial assets and securities using GPT models and Google search results 115
angelognazzo/reliable-trustworthy-ai An implementation of a DeepPoly-based verifier for robustness analysis in deep neural networks 1
decron/whitebox-code-gpt A repository of GPT-powered programming assistants to support developers in their work. 205
sturdy-dev/codereview.gpt Reviews Pull/Merge Requests using AI-powered chatbot 560
0xeb/gpt-analyst A resource repository providing tools and guides for analyzing and reverse engineering GPT models. 181
wireghoul/graudit A tool to identify potential security flaws in source code using static analysis and regular expressions. 1,538
mattzcarey/code-review-gpt An automated code review tool powered by Large Language Models that scans source code for potential issues and provides feedback 1,600
guanghelee/neurips19-certificates-of-robustness Tight certificates of adversarial robustness for randomly smoothed classifiers 17
akamai-threat-research/mqtt-pwn A tool for penetration testing and security assessment of MQTT brokers using various exploitation techniques. 367
albertwy/gpt-4v-evaluation An evaluation framework for GPT-4V models using data from An Early Evaluation of GPT-4V(ision) 11
borealisai/advertorch A toolbox for researching and evaluating robustness against attacks on machine learning models 1,308