transformer-explainer

Model explainer

An interactive visualization tool to help users understand how large language models like GPT work

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

GitHub

3k stars
34 watching
290 forks
Language: JavaScript
last commit: about 1 month ago
Linked from 1 awesome list

deep-learninggenerative-aigptlangauge-modelllmvisualization

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
google-research/vision_transformer Provides pre-trained models and code for training vision transformers and mixers using JAX/Flax 10,450
poloclub/cnn-explainer An interactive visualization system to help non-experts learn about Convolutional Neural Networks (CNNs) by visualizing the learning process. 8,109
kimiyoung/transformer-xl Implementations of a neural network architecture for language modeling 3,611
huggingface/transformers A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. 135,022
jbloomaus/decisiontransformerinterpretability An open-source project that provides tools and utilities to understand how transformers are used in reinforcement learning tasks. 73
google-research/text-to-text-transfer-transformer Provides tools and libraries for training and fine-tuning large language models using transformer architectures 6,170
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,160
dair-ai/ml-papers-explained An explanation of key concepts and advancements in the field of Machine Learning 7,315
matlab-deep-learning/transformer-models An implementation of deep learning transformer models in MATLAB 206
huggingface/tflite-android-transformers Converts popular transformer models to run on Android devices for efficient inference and generation tasks. 392
fastnlp/cpt A pre-trained transformer model for natural language understanding and generation tasks in Chinese 481
openai/transformer-debugger An open-source tool that helps investigate specific behaviors of small language models by combining automated interpretability techniques with sparse autoencoders. 4,035
zhuiyitechnology/gau-alpha An implementation of a Gated Attention Unit-based Transformer model for natural language processing tasks 96
huggingface/trl A library designed to train transformer language models with reinforcement learning using various optimization techniques and fine-tuning methods. 10,053
marella/ctransformers Provides a unified interface to various transformer models implemented in C/C++ using GGML library 1,814