transformer-explainer
Model explainer
An interactive visualization tool to help users understand how large language models like GPT work
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
3k stars
34 watching
290 forks
Language: JavaScript
last commit: about 1 month ago
Linked from 1 awesome list
deep-learninggenerative-aigptlangauge-modelllmvisualization
Related projects:
Repository | Description | Stars |
---|---|---|
google-research/vision_transformer | Provides pre-trained models and code for training vision transformers and mixers using JAX/Flax | 10,450 |
poloclub/cnn-explainer | An interactive visualization system to help non-experts learn about Convolutional Neural Networks (CNNs) by visualizing the learning process. | 8,109 |
kimiyoung/transformer-xl | Implementations of a neural network architecture for language modeling | 3,611 |
huggingface/transformers | A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. | 135,022 |
jbloomaus/decisiontransformerinterpretability | An open-source project that provides tools and utilities to understand how transformers are used in reinforcement learning tasks. | 73 |
google-research/text-to-text-transfer-transformer | Provides tools and libraries for training and fine-tuning large language models using transformer architectures | 6,170 |
openai/finetune-transformer-lm | This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,160 |
dair-ai/ml-papers-explained | An explanation of key concepts and advancements in the field of Machine Learning | 7,315 |
matlab-deep-learning/transformer-models | An implementation of deep learning transformer models in MATLAB | 206 |
huggingface/tflite-android-transformers | Converts popular transformer models to run on Android devices for efficient inference and generation tasks. | 392 |
fastnlp/cpt | A pre-trained transformer model for natural language understanding and generation tasks in Chinese | 481 |
openai/transformer-debugger | An open-source tool that helps investigate specific behaviors of small language models by combining automated interpretability techniques with sparse autoencoders. | 4,035 |
zhuiyitechnology/gau-alpha | An implementation of a Gated Attention Unit-based Transformer model for natural language processing tasks | 96 |
huggingface/trl | A library designed to train transformer language models with reinforcement learning using various optimization techniques and fine-tuning methods. | 10,053 |
marella/ctransformers | Provides a unified interface to various transformer models implemented in C/C++ using GGML library | 1,814 |