PLMpapers
Language model research paper collection
Compiles and organizes key papers on pre-trained language models, providing a resource for developers and researchers.
Must-read Papers on pre-trained language models.
3k stars
149 watching
436 forks
last commit: about 2 years ago
Linked from 2 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
mooler0410/llmspracticalguide | A curated list of resources to help developers navigate the landscape of large language models and their applications in NLP | 9,489 |
thunlp/promptpapers | A curated list of papers on prompt-based tuning for pre-trained language models, providing insights and advancements in the field. | 4,092 |
mlabonne/llm-course | A comprehensive course and resource package on building and deploying Large Language Models (LLMs) | 39,120 |
huawei-noah/pretrained-language-model | A collection of pre-trained language models and optimization techniques for efficient natural language processing | 3,028 |
balavenkatesh3322/nlp-pretrained-model | A collection of pre-trained natural language processing models | 170 |
brightmart/text_classification | An NLP project offering various text classification models and techniques for deep learning exploration | 7,861 |
thudm/glm | A general-purpose language model pre-trained with an autoregressive blank-filling objective and designed for various natural language understanding and generation tasks. | 3,199 |
zjh-819/llmdatahub | A curated collection of high-quality datasets for training large language models. | 2,635 |
cluebenchmark/cluepretrainedmodels | Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. | 804 |
brexhq/prompt-engineering | Guides software developers on how to effectively use and build systems around Large Language Models like GPT-4. | 8,440 |
dair-ai/ml-papers-explained | An explanation of key concepts and advancements in the field of Machine Learning | 7,315 |
thunlp/openclap | A repository of pre-trained language models for natural language processing tasks in Chinese | 979 |
instruction-tuning-with-gpt-4/gpt-4-llm | This project generates instruction-following data using GPT-4 to fine-tune large language models for real-world tasks. | 4,210 |
young-geng/easylm | A framework for training and serving large language models using JAX/Flax | 2,409 |
rasbt/llms-from-scratch | Developing and pretraining a GPT-like Large Language Model from scratch | 32,908 |