xlnet

Language model framework

A deep learning framework for unsupervised language representation learning and natural language processing tasks using a generalized autoregressive pretraining method.

XLNet: Generalized Autoregressive Pretraining for Language Understanding

GitHub

6k stars
170 watching
1k forks
Language: Python
last commit: over 1 year ago
deep-learningnlptensorflow

Related projects:

Repository Description Stars
brightmart/xlnet_zh Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks 230
ymcui/chinese-xlnet Provides pre-trained models for Chinese natural language processing tasks using the XLNet architecture 1,653
brightmart/text_classification An NLP project offering various text classification models and techniques for deep learning exploration 7,861
huawei-noah/pretrained-language-model A collection of pre-trained language models and optimization techniques for efficient natural language processing 3,028
aksnzhy/xlearn A high-performance machine learning package with linear models and factorization machines. 3,087
dair-ai/ml-papers-explained An explanation of key concepts and advancements in the field of Machine Learning 7,315
thunlp/plmpapers Compiles and organizes key papers on pre-trained language models, providing a resource for developers and researchers. 3,328
kimiyoung/transformer-xl Implementations of a neural network architecture for language modeling 3,611
openai/gpt-2 A repository providing code and models for research into language modeling and multitask learning 22,516
zjh-819/llmdatahub A curated collection of high-quality datasets for training large language models. 2,635
codertimo/bert-pytorch An implementation of Google's 2018 BERT model in PyTorch, allowing pre-training and fine-tuning for natural language processing tasks 6,222
microsoft/mpnet Develops a method for pre-training language understanding models by combining masked and permuted techniques, and provides code for implementation and fine-tuning. 288
brexhq/prompt-engineering Guides software developers on how to effectively use and build systems around Large Language Models like GPT-4. 8,440
thudm/glm A general-purpose language model pre-trained with an autoregressive blank-filling objective and designed for various natural language understanding and generation tasks. 3,199
tensorflow/adanet An automated machine learning framework that learns to ensemble sub-networks to improve model quality 3,468