DecodingTrust

GPT model trustworthiness assessor

An assessment tool for evaluating trustworthiness in GPT models across various aspects such as toxicity, bias, robustness, and fairness.

A Comprehensive Assessment of Trustworthiness in GPT Models

GitHub

267 stars

6 watching

57 forks

Language: Python

last commit: 11 months ago

Screenshot of AI-secure/DecodingTrust website

decodingtrust.github.io/

Related projects:

Repository	Description	Stars
howiehwong/trustllm	A toolkit for assessing trustworthiness in large language models	491
geeks-of-data/knowledge-gpt	Extracts and stores information from various sources using AI models to generate answers.	283
azure/pyrit	Empowers security professionals to identify risks in generative AI systems by providing a framework for proactive risk assessment and red teaming.	1,977
trusted-ai/aix360	A toolkit for explaining complex AI models and data-driven insights	1,641
kvignesh122/assetnewssentimentanalyzer	An application providing sentiment analysis tools for financial assets and securities using GPT models and Google search results	120
angelognazzo/reliable-trustworthy-ai	An implementation of a DeepPoly-based verifier for robustness analysis in deep neural networks	2
decron/whitebox-code-gpt	A repository of GPT-powered programming assistants to support developers in their work.	206
sturdy-dev/codereview.gpt	Reviews Pull/Merge Requests using AI-powered chatbot	561
0xeb/gpt-analyst	A resource repository providing tools and guides for analyzing and reverse engineering GPT models.	184
wireghoul/graudit	A tool to identify potential security flaws in source code using static analysis and regular expressions.	1,548
mattzcarey/code-review-gpt	An automated code review tool powered by Large Language Models that scans source code for potential issues and provides feedback	1,633
guanghelee/neurips19-certificates-of-robustness	Provides a framework for computing tight certificates of adversarial robustness for randomly smoothed classifiers.	17
akamai-threat-research/mqtt-pwn	A tool for penetration testing and security assessment of MQTT brokers using various exploitation techniques.	370
albertwy/gpt-4v-evaluation	An evaluation framework for GPT-4V models using data from An Early Evaluation of GPT-4V(ision)	11
borealisai/advertorch	A toolbox for researching and evaluating robustness against attacks on machine learning models	1,311