xlnet

Language model framework

A deep learning framework for unsupervised language representation learning and natural language processing tasks using a generalized autoregressive pretraining method.

XLNet: Generalized Autoregressive Pretraining for Language Understanding

GitHub

6k stars

170 watching

1k forks

Language: Python

last commit: about 2 years ago

deep-learningnlptensorflow

Related projects:

Repository	Description	Stars
brightmart/xlnet_zh	Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks	230
ymcui/chinese-xlnet	Provides pre-trained models for Chinese natural language processing tasks using the XLNet architecture	1,652
brightmart/text_classification	An NLP project offering various text classification models and techniques for deep learning exploration	7,881
huawei-noah/pretrained-language-model	A collection of pre-trained language models and optimization techniques for efficient natural language processing	3,039
aksnzhy/xlearn	A high-performance machine learning package with linear models and factorization machines.	3,087
dair-ai/ml-papers-explained	An explanation of key concepts and advancements in the field of Machine Learning	7,352
thunlp/plmpapers	Compiles and organizes key papers on pre-trained language models, providing a resource for developers and researchers.	3,331
kimiyoung/transformer-xl	Implementations of a neural network architecture for language modeling	3,619
openai/gpt-2	A repository providing code and models for research into language modeling and multitask learning	22,644
zjh-819/llmdatahub	A curated collection of high-quality datasets for training large language models.	2,708
codertimo/bert-pytorch	An implementation of Google's 2018 BERT model in PyTorch, allowing pre-training and fine-tuning for natural language processing tasks	6,251
microsoft/mpnet	Develops a method for pre-training language understanding models by combining masked and permuted techniques, and provides code for implementation and fine-tuning.	288
brexhq/prompt-engineering	Guides software developers on how to effectively use and build systems around Large Language Models like GPT-4.	8,487
thudm/glm	A general-purpose language model pre-trained with an autoregressive blank-filling objective and designed for various natural language understanding and generation tasks.	3,207
tensorflow/adanet	An automated machine learning framework that learns to ensemble sub-networks to improve model quality	3,470