pCLUE
NLP multi-task dataset
A large-scale dataset for training models to perform multiple tasks and zero-shot learning in natural language processing.
pCLUE: 1000000+多任务提示学习数据集
468 stars
7 watching
56 forks
Language: Jupyter Notebook
last commit: about 2 years ago chinesecluedatasetsmulti-task-learningprompt-learningpromptcluezero-shot-learning
Related projects:
Repository | Description | Stars |
---|---|---|
clue-ai/promptclue | A pre-trained language model for multiple natural language processing tasks with support for few-shot learning and transfer learning. | 654 |
cluebenchmark/supercluelyb | A benchmarking platform for evaluating Chinese general-purpose models through anonymous, random battles | 141 |
michael-wzhu/promptcblue | A large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain | 323 |
cluebenchmark/cluecorpus2020 | A large-scale pre-training corpus for Chinese language models | 925 |
cluebenchmark/cluepretrainedmodels | Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. | 804 |
cluebenchmark/electra | Trains and evaluates a Chinese language model using adversarial training on a large corpus. | 140 |
clue-ai/chatyuan-7b | An updated version of a large language model designed to improve performance on multiple tasks and datasets | 13 |
crownpku/small-chinese-corpus | A collection of datasets and tools for NLP tasks on Chinese texts, including part-of-speech tagging, named entity recognition, and question answering. | 531 |
karthikncode/nlp-datasets | A curated list of Natural Language Processing datasets used to train and evaluate NLP models. | 919 |
pks/zipf | A Ruby NLP library providing tools and data structures for natural language processing tasks | 3 |
hanzhenlei767/nlp_learn | A comprehensive collection of NLP-related code snippets and notes on various models and techniques, including pre-trained language models and Chinese text processing methods. | 25 |
clue-ai/chatyuan | Large language model for dialogue support in multiple languages | 1,902 |
louismullie/stanford-core-nlp | Provides Ruby bindings to Stanford Core NLP tools for natural language processing tasks | 432 |
alexa/massive | A collection of tools and modeling code for a large multilingual Natural Language Understanding dataset | 538 |
uclnlp/jack | A framework for building machine reading comprehension models using natural language processing techniques | 257 |