DecodingTrust

GPT model trustworthiness assessor

An assessment tool for evaluating trustworthiness in GPT models across various aspects such as toxicity, bias, robustness, and fairness.

A Comprehensive Assessment of Trustworthiness in GPT Models

GitHub

267 stars
6 watching
57 forks
Language: Python
last commit: 4 months ago

Related projects:

Repository Description Stars
howiehwong/trustllm A toolkit for assessing trustworthiness in large language models 491
geeks-of-data/knowledge-gpt Extracts and stores information from various sources using AI models to generate answers. 283
azure/pyrit Empowers security professionals to identify risks in generative AI systems by providing a framework for proactive risk assessment and red teaming. 1,977
trusted-ai/aix360 A toolkit for explaining complex AI models and data-driven insights 1,641
kvignesh122/assetnewssentimentanalyzer An application providing sentiment analysis tools for financial assets and securities using GPT models and Google search results 120
angelognazzo/reliable-trustworthy-ai An implementation of a DeepPoly-based verifier for robustness analysis in deep neural networks 2
decron/whitebox-code-gpt A repository of GPT-powered programming assistants to support developers in their work. 206
sturdy-dev/codereview.gpt Reviews Pull/Merge Requests using AI-powered chatbot 561
0xeb/gpt-analyst A resource repository providing tools and guides for analyzing and reverse engineering GPT models. 184
wireghoul/graudit A tool to identify potential security flaws in source code using static analysis and regular expressions. 1,548
mattzcarey/code-review-gpt An automated code review tool powered by Large Language Models that scans source code for potential issues and provides feedback 1,633
guanghelee/neurips19-certificates-of-robustness Provides a framework for computing tight certificates of adversarial robustness for randomly smoothed classifiers. 17
akamai-threat-research/mqtt-pwn A tool for penetration testing and security assessment of MQTT brokers using various exploitation techniques. 370
albertwy/gpt-4v-evaluation An evaluation framework for GPT-4V models using data from An Early Evaluation of GPT-4V(ision) 11
borealisai/advertorch A toolbox for researching and evaluating robustness against attacks on machine learning models 1,311