pCLUE

NLP multi-task dataset

A large-scale dataset for training models to perform multiple tasks and zero-shot learning in natural language processing.

pCLUE: 1000000+多任务提示学习数据集

GitHub

473 stars

7 watching

56 forks

Language: Jupyter Notebook

last commit: almost 4 years ago

chinesecluedatasetsmulti-task-learningprompt-learningpromptcluezero-shot-learning

Screenshot of CLUEbenchmark/pCLUE website

www.cluebenchmarks.com/clueai.html

Related projects:

Repository	Description	Stars
clue-ai/promptclue	A pre-trained language model for multiple natural language processing tasks with support for few-shot learning and transfer learning.	656
cluebenchmark/supercluelyb	A benchmarking platform for evaluating Chinese general-purpose models through anonymous, random battles	143
michael-wzhu/promptcblue	A large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain	328
cluebenchmark/cluecorpus2020	A large-scale Chinese corpus for pre-training language models.	927
cluebenchmark/cluepretrainedmodels	Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models.	806
cluebenchmark/electra	Trains and evaluates a Chinese language model using adversarial training on a large corpus.	140
clue-ai/chatyuan-7b	An updated version of a large language model designed to improve performance on multiple tasks and datasets	13
crownpku/small-chinese-corpus	A collection of datasets and tools for NLP tasks on Chinese texts, including part-of-speech tagging, named entity recognition, and question answering.	529
karthikncode/nlp-datasets	A curated list of Natural Language Processing datasets used to train and evaluate NLP models.	919
pks/zipf	A Ruby NLP library providing tools and data structures for natural language processing tasks	3
clue-ai/chatyuan	Large language model for dialogue support in multiple languages	1,903
louismullie/stanford-core-nlp	Provides Ruby bindings to Stanford Core NLP tools for natural language processing tasks	433
alexa/massive	A collection of tools and modeling code for a large multilingual Natural Language Understanding dataset	541
uclnlp/jack	A framework for building machine reading comprehension models using natural language processing techniques	257