transformer-xl
Language model backend
Implementations of a neural network architecture for language modeling
4k stars
84 watching
762 forks
Language: Python
last commit: over 2 years ago Related projects:
Repository | Description | Stars |
---|---|---|
| A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. | 136,357 |
| Provides tools and libraries for training and fine-tuning large language models using transformer architectures | 6,215 |
| An explanation of key concepts and advancements in the field of Machine Learning | 7,352 |
| An interactive visualization tool to help users understand how large language models like GPT work | 3,604 |
| Provides dense vector representations for text using transformer networks | 15,556 |
| A library designed to train transformer language models with reinforcement learning using various optimization techniques and fine-tuning methods. | 10,308 |
| A tool for training and sampling character-level language models using multi-layer recurrent neural networks | 2,643 |
| Implementing OpenAI's transformer language model in PyTorch with pre-trained weights and fine-tuning capabilities | 1,511 |
| A toolkit providing a library of easy-to-use ML modules and functionalities for composing various machine learning models and algorithms in TensorFlow. | 2,388 |
| A PyTorch model definition and inference/sampling code repository for a powerful diffusion transformer with fine-grained Chinese understanding | 3,678 |
| Real-time speech synthesis using state-of-the-art architectures | 3,855 |
| An implementation of the Transformer model in PyTorch, a deep learning framework for sequence-to-sequence tasks like language translation. | 8,936 |
| This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,167 |
| A high-performance transformer-based NLP component optimized for GPU acceleration and integration into various frameworks. | 5,937 |
| Developing and pretraining a GPT-like Large Language Model from scratch | 35,405 |