DecisionTransformerInterpretability

Transformer Explainability Tool

An open-source project that provides tools and utilities to understand how transformers are used in reinforcement learning tasks.

Interpreting how transformers simulate agents performing RL tasks

GitHub

73 stars
4 watching
16 forks
Language: Jupyter Notebook
last commit: about 1 year ago
mechanistic-interpretabilityreinforcement-learning

Related projects:

Repository Description Stars
thomasp85/lime An R package for providing explanations of predictions made by black box classifiers. 485
pbiecek/xaiaterum2020 An R package and workshop materials for explaining machine learning models using explainable AI techniques 52
transformerlensorg/transformerlens A library for reverse engineering the algorithms learned by large language models from their weights 1,598
marcotcr/anchor Provides a method to generate explanations for predictions made by any black box classifier. 798
jsksxs360/how-to-use-transformers A comprehensive guide to using the Transformers library for natural language processing tasks 1,133
marella/ctransformers Provides a unified interface to various transformer models implemented in C/C++ using GGML library 1,814
tongjilibo/bert4torch An implementation of transformer models in PyTorch for natural language processing tasks 1,241
tensorflow/tcav An interpretability method that provides explanations for neural network predictions by highlighting high-level concepts relevant to classification tasks. 632
understandable-machine-intelligence-lab/quantus An eXplainable AI toolkit for evaluating and interpreting neural network explanations in various deep learning frameworks. 556
ibrahimsobh/transformers An implementation of deep neural network architectures, including Transformers, in Python. 212
matlab-deep-learning/transformer-models An implementation of deep learning transformer models in MATLAB 206
huggingface/tflite-android-transformers Converts popular transformer models to run on Android devices for efficient inference and generation tasks. 392
jlevy/repren A tool for refactoring and transforming text files according to regular expression patterns 347
abhimanyu003/sttr A command-line utility for transforming strings using various encoding and decoding algorithms. 958
lucidrains/reformer-pytorch An implementation of Reformer, an efficient Transformer model for natural language processing tasks. 2,120