FasterTransformer

Transformer component

A high-performance transformer-based NLP component optimized for GPU acceleration and integration into various frameworks.

Transformer related optimization, including BERT, GPT

GitHub

6k stars
62 watching
895 forks
Language: C++
last commit: 10 months ago
Linked from 1 awesome list

bertgptpytorchtransformer

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
opennmt/ctranslate2 A high-performance inference engine for transformer models 3,467
nvidia/tensorrt Provides a set of tools and libraries for optimizing deep learning inference on NVIDIA GPUs. 10,926
google-research/text-to-text-transfer-transformer Provides tools and libraries for training and fine-tuning large language models using transformer architectures 6,215
kimiyoung/transformer-xl Implementations of a neural network architecture for language modeling 3,619
nvidia/megatron-lm A framework for training large language models using scalable and optimized GPU techniques 10,804
codertimo/bert-pytorch An implementation of Google's 2018 BERT model in PyTorch, allowing pre-training and fine-tuning for natural language processing tasks 6,251
nvidia/minkowskiengine An auto-differentiation library for sparse tensors used in computer vision and deep learning applications. 2,513
facebookresearch/metaseq A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. 6,519
eleutherai/gpt-neox Provides a framework for training large-scale language models on GPUs with advanced features and optimizations. 6,997
tensorspeech/tensorflowtts Real-time speech synthesis using state-of-the-art architectures 3,855
google-research/bert Provides pre-trained models and code for natural language processing tasks using TensorFlow 38,374
google-research/vision_transformer Provides pre-trained models and code for training vision transformers and mixers using JAX/Flax 10,620
luolc/adabound An optimizer that combines the benefits of Adam and SGD algorithms 2,908
tensorpack/tensorpack A high-performance neural network training interface for TensorFlow that focuses on speed and flexibility. 6,303
brightmart/roberta_zh Implements RoBERTa for Chinese pre-training using TensorFlow and provides PyTorch versions for loading and training 2,638