TransformerLens
Model decipherer
A library for reverse engineering the algorithms learned by large language models from their weights
A library for mechanistic interpretability of GPT-style language models
2k stars
16 watching
315 forks
Language: Python
last commit: 2 months ago Related projects:
Repository | Description | Stars |
---|---|---|
| An implementation of deep learning transformer models in MATLAB | 209 |
| An implementation of Reformer, an efficient Transformer model for natural language processing tasks. | 2,132 |
| A command-line tool for mass-replacing text patterns in files and renaming directories recursively. | 355 |
| This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,167 |
| Provides a collection of reusable data transformation tools | 10 |
| Implementation of a transformer-based translation model in PyTorch | 240 |
| A Python library with multiple transformers to engineer and select features for use in machine learning models. | 1,956 |
| A pre-trained transformer model for natural language understanding and generation tasks in Chinese | 482 |
| Provides a unified interface to various transformer models implemented in C/C++ using GGML library | 1,823 |
| A Python library to manipulate and transform indexable data | 49 |
| A library for serializing and deserializing data structures into strings, mappings, and lists while performing validation. | 451 |
| An open-source project that provides tools and utilities to understand how transformers are used in reinforcement learning tasks. | 75 |
| Research tool for training large transformer language models at scale | 1,926 |
| A collection of tools and scripts for training large transformer language models at scale | 1,342 |
| A generative transformer model designed to process and generate text in various vertical domains, including computer science, finance, and more. | 217 |