PLMpapers

Language model research paper collection

Compiles and organizes key papers on pre-trained language models, providing a resource for developers and researchers.

Must-read Papers on pre-trained language models.

GitHub

3k stars
149 watching
436 forks
last commit: about 2 years ago
Linked from 2 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
mooler0410/llmspracticalguide A curated list of resources to help developers navigate the landscape of large language models and their applications in NLP 9,489
thunlp/promptpapers A curated list of papers on prompt-based tuning for pre-trained language models, providing insights and advancements in the field. 4,092
mlabonne/llm-course A comprehensive course and resource package on building and deploying Large Language Models (LLMs) 39,120
huawei-noah/pretrained-language-model A collection of pre-trained language models and optimization techniques for efficient natural language processing 3,028
balavenkatesh3322/nlp-pretrained-model A collection of pre-trained natural language processing models 170
brightmart/text_classification An NLP project offering various text classification models and techniques for deep learning exploration 7,861
thudm/glm A general-purpose language model pre-trained with an autoregressive blank-filling objective and designed for various natural language understanding and generation tasks. 3,199
zjh-819/llmdatahub A curated collection of high-quality datasets for training large language models. 2,635
cluebenchmark/cluepretrainedmodels Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. 804
brexhq/prompt-engineering Guides software developers on how to effectively use and build systems around Large Language Models like GPT-4. 8,440
dair-ai/ml-papers-explained An explanation of key concepts and advancements in the field of Machine Learning 7,315
thunlp/openclap A repository of pre-trained language models for natural language processing tasks in Chinese 979
instruction-tuning-with-gpt-4/gpt-4-llm This project generates instruction-following data using GPT-4 to fine-tune large language models for real-world tasks. 4,210
young-geng/easylm A framework for training and serving large language models using JAX/Flax 2,409
rasbt/llms-from-scratch Developing and pretraining a GPT-like Large Language Model from scratch 32,908