transformer-xl

Language model backend

Implementations of a neural network architecture for language modeling

GitHub

4k stars
84 watching
762 forks
Language: Python
last commit: about 2 years ago

Related projects:

Repository Description Stars
huggingface/transformers A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. 135,022
google-research/text-to-text-transfer-transformer Provides tools and libraries for training and fine-tuning large language models using transformer architectures 6,170
dair-ai/ml-papers-explained An explanation of key concepts and advancements in the field of Machine Learning 7,315
poloclub/transformer-explainer An interactive visualization tool to help users understand how large language models like GPT work 3,347
ukplab/sentence-transformers Provides dense vector representations for text using transformer networks 15,329
huggingface/trl A library designed to train transformer language models with reinforcement learning using various optimization techniques and fine-tuning methods. 10,053
sherjilozair/char-rnn-tensorflow A tool for training and sampling character-level language models using multi-layer recurrent neural networks 2,643
huggingface/pytorch-openai-transformer-lm Implementing OpenAI's transformer language model in PyTorch with pre-trained weights and fine-tuning capabilities 1,511
asyml/texar A toolkit providing a library of easy-to-use ML modules and functionalities for composing various machine learning models and algorithms in TensorFlow. 2,389
tencent/hunyuandit A PyTorch-based diffusion transformer model for generating images with fine-grained Chinese understanding and text-to-image synthesis 3,456
tensorspeech/tensorflowtts Real-time speech synthesis using state-of-the-art architectures 3,839
jadore801120/attention-is-all-you-need-pytorch An implementation of the Transformer model in PyTorch, a deep learning framework for sequence-to-sequence tasks like language translation. 8,868
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,160
nvidia/fastertransformer A high-performance transformer-based NLP component optimized for GPU acceleration and integration into various frameworks. 5,886
rasbt/llms-from-scratch Developing and pretraining a GPT-like Large Language Model from scratch 32,908