text-to-text-transfer-transformer
Transformer library
Provides tools and libraries for training and fine-tuning large language models using transformer architectures
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
6k stars
108 watching
756 forks
Language: Python
last commit: 2 months ago Related projects:
Repository | Description | Stars |
---|---|---|
google-research/t5x | A modular framework for training and deploying sequence models at scale | 2,682 |
huggingface/transformers | A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. | 135,022 |
kimiyoung/transformer-xl | Implementations of a neural network architecture for language modeling | 3,611 |
google-research/vision_transformer | Provides pre-trained models and code for training vision transformers and mixers using JAX/Flax | 10,450 |
poloclub/transformer-explainer | An interactive visualization tool to help users understand how large language models like GPT work | 3,347 |
google/trax | An end-to-end deep learning library with clear code and speed | 8,096 |
dair-ai/ml-papers-explained | An explanation of key concepts and advancements in the field of Machine Learning | 7,315 |
codertimo/bert-pytorch | An implementation of Google's 2018 BERT model in PyTorch, allowing pre-training and fine-tuning for natural language processing tasks | 6,222 |
nvidia/fastertransformer | A high-performance transformer-based NLP component optimized for GPU acceleration and integration into various frameworks. | 5,886 |
huggingface/tflite-android-transformers | Converts popular transformer models to run on Android devices for efficient inference and generation tasks. | 392 |
huggingface/trl | A library designed to train transformer language models with reinforcement learning using various optimization techniques and fine-tuning methods. | 10,053 |
ukplab/sentence-transformers | Provides dense vector representations for text using transformer networks | 15,329 |
openai/finetune-transformer-lm | This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,160 |
microsoft/megatron-deepspeed | Research tool for training large transformer language models at scale | 1,895 |
huggingface/text-generation-inference | A toolkit for deploying and serving Large Language Models. | 9,106 |