PromptCBLUE

Medical NLP training data

A large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain

PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain in Chinese

GitHub

323 stars
6 watching
33 forks
Language: Python
last commit: 10 months ago

Related projects:

Repository Description Stars
michael-wzhu/chatmed Develops and deploys large language models for Chinese medical consultations to improve answer accuracy 518
cluebenchmark/pclue A large-scale dataset for training models to perform multiple tasks and zero-shot learning in natural language processing. 468
michael-wzhu/shennong-tcm-llm Develops and deploys a large language model for Chinese traditional medicine applications 299
clue-ai/promptclue A pre-trained language model for multiple natural language processing tasks with support for few-shot learning and transfer learning. 654
hanzhenlei767/nlp_learn A comprehensive collection of NLP-related code snippets and notes on various models and techniques, including pre-trained language models and Chinese text processing methods. 25
langboat/mengzi Develops lightweight yet powerful pre-trained models for natural language processing tasks 534
mne-tools/mne-python-notebooks Interactive notebooks for EEG/MEG data analysis using Python 26
openmedlab/pulse A unified language service engine pre-trained on large amounts of medical domain data 467
brightmart/xlnet_zh Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks 230
fuxiaoliu/mmc Develops a large-scale dataset and benchmark for training multimodal chart understanding models using large language models. 84
ncbi-nlp/bluebert Pre-trained language models for biomedical natural language processing tasks 558
ymcui/cmrc2018 A collection of data for evaluating Chinese machine reading comprehension systems 415
alexa/massive A collection of tools and modeling code for a large multilingual Natural Language Understanding dataset 538
mikegu721/xiezhibenchmark An evaluation suite to assess language models' performance in multi-choice questions 91
open-mmlab/mmengine Provides a flexible and configurable framework for training deep learning models with PyTorch. 1,179