FasterTransformer

Transformer component

A high-performance transformer-based NLP component optimized for GPU acceleration and integration into various frameworks.

Transformer related optimization, including BERT, GPT

GitHub

6k stars

62 watching

895 forks

Language: C++

last commit: over 1 year ago

Linked from 1 awesome list

bertgptpytorchtransformer

Backlinks from these awesome lists:

hannibal046/awesome-llm

Related projects:

Repository	Description	Stars
opennmt/ctranslate2	A high-performance inference engine for transformer models	3,467
nvidia/tensorrt	Provides a set of tools and libraries for optimizing deep learning inference on NVIDIA GPUs.	10,926
google-research/text-to-text-transfer-transformer	Provides tools and libraries for training and fine-tuning large language models using transformer architectures	6,215
kimiyoung/transformer-xl	Implementations of a neural network architecture for language modeling	3,619
nvidia/megatron-lm	A framework for training large language models using scalable and optimized GPU techniques	10,804
codertimo/bert-pytorch	An implementation of Google's 2018 BERT model in PyTorch, allowing pre-training and fine-tuning for natural language processing tasks	6,251
nvidia/minkowskiengine	An auto-differentiation library for sparse tensors used in computer vision and deep learning applications.	2,513
facebookresearch/metaseq	A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms.	6,519
eleutherai/gpt-neox	Provides a framework for training large-scale language models on GPUs with advanced features and optimizations.	6,997
tensorspeech/tensorflowtts	Real-time speech synthesis using state-of-the-art architectures	3,855
google-research/bert	Provides pre-trained models and code for natural language processing tasks using TensorFlow	38,374
google-research/vision_transformer	Provides pre-trained models and code for training vision transformers and mixers using JAX/Flax	10,620
luolc/adabound	An optimizer that combines the benefits of Adam and SGD algorithms	2,908
tensorpack/tensorpack	A high-performance neural network training interface for TensorFlow that focuses on speed and flexibility.	6,303
brightmart/roberta_zh	Implements RoBERTa for Chinese pre-training using TensorFlow and provides PyTorch versions for loading and training	2,638