pythia

Knowledge analyzer

Analyzing knowledge development and evolution in large language models during training

The hub for EleutherAI's work on interpretability and learning dynamics

GitHub

2k stars
33 watching
173 forks
Language: Jupyter Notebook
last commit: 20 days ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
eleutherai/gpt-neox Provides a framework for training large-scale language models on GPUs with advanced features and optimizations. 6,941
huggingface/transformers A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. 135,022
pytorch/examples A collection of curated examples showcasing various PyTorch applications in computer vision, natural language processing, and reinforcement learning. 22,428
pytorch/captum Provides tools and algorithms to understand how machine learning models make predictions 4,931
karpathy/mingpt A minimal PyTorch implementation of a transformer-based language model 20,175
timeseriesai/tsai A comprehensive deep learning package for time series data analysis and forecasting. 5,262
databrickslabs/dolly A large language model trained on a commercial machine learning platform with limited capabilities 10,820
lucidrains/reformer-pytorch An implementation of Reformer, an efficient Transformer model for natural language processing tasks. 2,120
codertimo/bert-pytorch An implementation of Google's 2018 BERT model in PyTorch, allowing pre-training and fine-tuning for natural language processing tasks 6,222
huggingface/pytorch-openai-transformer-lm Implementing OpenAI's transformer language model in PyTorch with pre-trained weights and fine-tuning capabilities 1,511
huggingface/trl A library designed to train transformer language models with reinforcement learning using various optimization techniques and fine-tuning methods. 10,053
google-research/electra A method for pre-training transformer networks to learn language representations from text data without labeled supervision 2,340
sktime/pytorch-forecasting A PyTorch-based package for state-of-the-art time series forecasting with deep learning architectures 4,001
kimiyoung/transformer-xl Implementations of a neural network architecture for language modeling 3,611
adapter-hub/adapters A unified library for parameter-efficient and modular transfer learning in NLP tasks 2,577