text-to-text-transfer-transformer

Transformer library

Provides tools and libraries for training and fine-tuning large language models using transformer architectures

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

GitHub

6k stars
108 watching
756 forks
Language: Python
last commit: 2 months ago

Related projects:

Repository Description Stars
google-research/t5x A modular framework for training and deploying sequence models at scale 2,682
huggingface/transformers A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. 135,022
kimiyoung/transformer-xl Implementations of a neural network architecture for language modeling 3,611
google-research/vision_transformer Provides pre-trained models and code for training vision transformers and mixers using JAX/Flax 10,450
poloclub/transformer-explainer An interactive visualization tool to help users understand how large language models like GPT work 3,347
google/trax An end-to-end deep learning library with clear code and speed 8,096
dair-ai/ml-papers-explained An explanation of key concepts and advancements in the field of Machine Learning 7,315
codertimo/bert-pytorch An implementation of Google's 2018 BERT model in PyTorch, allowing pre-training and fine-tuning for natural language processing tasks 6,222
nvidia/fastertransformer A high-performance transformer-based NLP component optimized for GPU acceleration and integration into various frameworks. 5,886
huggingface/tflite-android-transformers Converts popular transformer models to run on Android devices for efficient inference and generation tasks. 392
huggingface/trl A library designed to train transformer language models with reinforcement learning using various optimization techniques and fine-tuning methods. 10,053
ukplab/sentence-transformers Provides dense vector representations for text using transformer networks 15,329
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,160
microsoft/megatron-deepspeed Research tool for training large transformer language models at scale 1,895
huggingface/text-generation-inference A toolkit for deploying and serving Large Language Models. 9,106