pythia

Knowledge analyzer

Analyzing knowledge development and evolution in large language models during training

The hub for EleutherAI's work on interpretability and learning dynamics

GitHub

2k stars

33 watching

175 forks

Language: Jupyter Notebook

last commit: 8 months ago

Linked from 1 awesome list

Backlinks from these awesome lists:

hannibal046/awesome-llm

Related projects:

Repository	Description	Stars
eleutherai/gpt-neox	Provides a framework for training large-scale language models on GPUs with advanced features and optimizations.	6,997
huggingface/transformers	A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects.	136,357
pytorch/examples	A collection of curated examples showcasing various PyTorch applications in computer vision, natural language processing, and reinforcement learning.	22,530
pytorch/captum	Provides tools and algorithms to understand how machine learning models make predictions	4,982
karpathy/mingpt	A minimal PyTorch implementation of a transformer-based language model	20,474
timeseriesai/tsai	A comprehensive deep learning package for time series data analysis and forecasting.	5,338
databrickslabs/dolly	A large language model trained on a commercial machine learning platform with limited capabilities	10,820
lucidrains/reformer-pytorch	An implementation of Reformer, an efficient Transformer model for natural language processing tasks.	2,132
codertimo/bert-pytorch	An implementation of Google's 2018 BERT model in PyTorch, allowing pre-training and fine-tuning for natural language processing tasks	6,251
huggingface/pytorch-openai-transformer-lm	Implementing OpenAI's transformer language model in PyTorch with pre-trained weights and fine-tuning capabilities	1,511
huggingface/trl	A library designed to train transformer language models with reinforcement learning using various optimization techniques and fine-tuning methods.	10,308
google-research/electra	A method for pre-training transformer networks to learn language representations from text data without labeled supervision	2,342
sktime/pytorch-forecasting	A PyTorch-based package for state-of-the-art time series forecasting with deep learning architectures	4,038
kimiyoung/transformer-xl	Implementations of a neural network architecture for language modeling	3,619
adapter-hub/adapters	A unified library for parameter-efficient and modular transfer learning in NLP tasks	2,600