transformer-xl
Language model backend
Implementations of a neural network architecture for language modeling
4k stars
84 watching
762 forks
Language: Python
last commit: about 2 years ago Related projects:
Repository | Description | Stars |
---|---|---|
huggingface/transformers | A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. | 135,022 |
google-research/text-to-text-transfer-transformer | Provides tools and libraries for training and fine-tuning large language models using transformer architectures | 6,170 |
dair-ai/ml-papers-explained | An explanation of key concepts and advancements in the field of Machine Learning | 7,315 |
poloclub/transformer-explainer | An interactive visualization tool to help users understand how large language models like GPT work | 3,347 |
ukplab/sentence-transformers | Provides dense vector representations for text using transformer networks | 15,329 |
huggingface/trl | A library designed to train transformer language models with reinforcement learning using various optimization techniques and fine-tuning methods. | 10,053 |
sherjilozair/char-rnn-tensorflow | A tool for training and sampling character-level language models using multi-layer recurrent neural networks | 2,643 |
huggingface/pytorch-openai-transformer-lm | Implementing OpenAI's transformer language model in PyTorch with pre-trained weights and fine-tuning capabilities | 1,511 |
asyml/texar | A toolkit providing a library of easy-to-use ML modules and functionalities for composing various machine learning models and algorithms in TensorFlow. | 2,389 |
tencent/hunyuandit | A PyTorch-based diffusion transformer model for generating images with fine-grained Chinese understanding and text-to-image synthesis | 3,456 |
tensorspeech/tensorflowtts | Real-time speech synthesis using state-of-the-art architectures | 3,839 |
jadore801120/attention-is-all-you-need-pytorch | An implementation of the Transformer model in PyTorch, a deep learning framework for sequence-to-sequence tasks like language translation. | 8,868 |
openai/finetune-transformer-lm | This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,160 |
nvidia/fastertransformer | A high-performance transformer-based NLP component optimized for GPU acceleration and integration into various frameworks. | 5,886 |
rasbt/llms-from-scratch | Developing and pretraining a GPT-like Large Language Model from scratch | 32,908 |