xlnet
Language model framework
A deep learning framework for unsupervised language representation learning and natural language processing tasks using a generalized autoregressive pretraining method.
XLNet: Generalized Autoregressive Pretraining for Language Understanding
6k stars
170 watching
1k forks
Language: Python
last commit: over 1 year ago deep-learningnlptensorflow
Related projects:
Repository | Description | Stars |
---|---|---|
brightmart/xlnet_zh | Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks | 230 |
ymcui/chinese-xlnet | Provides pre-trained models for Chinese natural language processing tasks using the XLNet architecture | 1,653 |
brightmart/text_classification | An NLP project offering various text classification models and techniques for deep learning exploration | 7,861 |
huawei-noah/pretrained-language-model | A collection of pre-trained language models and optimization techniques for efficient natural language processing | 3,028 |
aksnzhy/xlearn | A high-performance machine learning package with linear models and factorization machines. | 3,087 |
dair-ai/ml-papers-explained | An explanation of key concepts and advancements in the field of Machine Learning | 7,315 |
thunlp/plmpapers | Compiles and organizes key papers on pre-trained language models, providing a resource for developers and researchers. | 3,328 |
kimiyoung/transformer-xl | Implementations of a neural network architecture for language modeling | 3,611 |
openai/gpt-2 | A repository providing code and models for research into language modeling and multitask learning | 22,516 |
zjh-819/llmdatahub | A curated collection of high-quality datasets for training large language models. | 2,635 |
codertimo/bert-pytorch | An implementation of Google's 2018 BERT model in PyTorch, allowing pre-training and fine-tuning for natural language processing tasks | 6,222 |
microsoft/mpnet | Develops a method for pre-training language understanding models by combining masked and permuted techniques, and provides code for implementation and fine-tuning. | 288 |
brexhq/prompt-engineering | Guides software developers on how to effectively use and build systems around Large Language Models like GPT-4. | 8,440 |
thudm/glm | A general-purpose language model pre-trained with an autoregressive blank-filling objective and designed for various natural language understanding and generation tasks. | 3,199 |
tensorflow/adanet | An automated machine learning framework that learns to ensemble sub-networks to improve model quality | 3,468 |