PromptCBLUE

Medical NLP training data

A large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain

PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain in Chinese

GitHub

328 stars
6 watching
34 forks
Language: Python
last commit: about 1 year ago

Related projects:

Repository Description Stars
michael-wzhu/chatmed Develops and deploys large language models for Chinese medical consultations to improve answer accuracy 531
cluebenchmark/pclue A large-scale dataset for training models to perform multiple tasks and zero-shot learning in natural language processing. 473
michael-wzhu/shennong-tcm-llm Develops and deploys a large language model for Chinese traditional medicine applications 316
clue-ai/promptclue A pre-trained language model for multiple natural language processing tasks with support for few-shot learning and transfer learning. 656
langboat/mengzi Develops lightweight yet powerful pre-trained models for natural language processing tasks 533
mne-tools/mne-python-notebooks Interactive notebooks for EEG/MEG data analysis using Python 26
openmedlab/pulse A unified language service engine pre-trained on large amounts of medical domain data 471
brightmart/xlnet_zh Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks 230
fuxiaoliu/mmc Develops a large-scale dataset and benchmark for training multimodal chart understanding models using large language models. 87
ncbi-nlp/bluebert Pre-trained language models for biomedical natural language processing tasks 560
ymcui/cmrc2018 A collection of data for evaluating Chinese machine reading comprehension systems 419
alexa/massive A collection of tools and modeling code for a large multilingual Natural Language Understanding dataset 541
mikegu721/xiezhibenchmark An evaluation suite to assess language models' performance in multi-choice questions 93
open-mmlab/mmengine Provides a flexible and configurable framework for training deep learning models with PyTorch. 1,196