DecisionTransformerInterpretability

Transformer Explainability Tool

An open-source project that provides tools and utilities to understand how transformers are used in reinforcement learning tasks.

Interpreting how transformers simulate agents performing RL tasks

GitHub

75 stars

4 watching

17 forks

Language: Jupyter Notebook

last commit: over 1 year ago

mechanistic-interpretabilityreinforcement-learning

Screenshot of jbloomAus/DecisionTransformerInterpretability website

jbloomaus-decisiontransformerinterpretability-app-4edcnc.streamlit.app/

Related projects:

Repository	Description	Stars
thomasp85/lime	An R package for providing explanations of predictions made by black box classifiers.	486
pbiecek/xaiaterum2020	An R package and workshop materials for explaining machine learning models using explainable AI techniques	52
transformerlensorg/transformerlens	A library for reverse engineering the algorithms learned by large language models from their weights	1,653
marcotcr/anchor	Provides a method to generate explanations for predictions made by any black box classifier.	798
jsksxs360/how-to-use-transformers	A comprehensive guide to using the Transformers library for natural language processing tasks	1,220
marella/ctransformers	Provides a unified interface to various transformer models implemented in C/C++ using GGML library	1,823
tongjilibo/bert4torch	An implementation of transformer models in PyTorch for natural language processing tasks	1,257
tensorflow/tcav	An interpretability method that provides explanations for neural network predictions by highlighting high-level concepts relevant to classification tasks.	633
understandable-machine-intelligence-lab/quantus	An eXplainable AI toolkit for evaluating and interpreting neural network explanations in various deep learning frameworks.	567
ibrahimsobh/transformers	An implementation of deep neural network architectures, including Transformers, in Python.	214
matlab-deep-learning/transformer-models	An implementation of deep learning transformer models in MATLAB	209
huggingface/tflite-android-transformers	Converts popular transformer models to run on Android devices for efficient inference and generation tasks.	396
jlevy/repren	A command-line tool for mass-replacing text patterns in files and renaming directories recursively.	355
abhimanyu003/sttr	A command-line utility for transforming strings using various encoding and decoding algorithms.	966
lucidrains/reformer-pytorch	An implementation of Reformer, an efficient Transformer model for natural language processing tasks.	2,132