Chinese-medical-dialogue-data

Medical dialogue dataset

A collection of medical dialogue data for training conversational AI models.

Chinese medical dialogue data 中文医疗对话数据集

GitHub

1k stars
20 watching
251 forks
Language: Python
last commit: over 1 year ago

Related projects:

Repository Description Stars
google-research-datasets/dstc8-schema-guided-dialogue This dataset provides annotated conversations between humans and virtual assistants to train machine learning models for dialogue systems. 553
thu-coai/cdial-gpt A large-scale Chinese conversation dataset and pre-trained dialog models for text generation 1,799
thu-coai/opd A large-scale pre-trained dialogue model for Chinese language 74
suprityoung/zhongjing Develops a large language model capable of handling complex medical conversations with high accuracy and professionalism. 324
michael-wzhu/chatmed Develops and deploys large language models for Chinese medical consultations to improve answer accuracy 531
zake7749/gossiping-chinese-corpus A collection of question-answer pairs extracted from online Chinese forums. 236
ufal-dsg/alex_context_nlg_dataset A dataset for training natural language generation models in dialogue systems by incorporating context information. 23
x-d-lab/sunsimiao A large-scale Chinese medical language model trained on diverse data sources to provide accurate and reliable medical information 407
xuefuzhao/instructionwild Creating a large-scale user-based instruction dataset for natural language processing research and development 455
freedomintelligence/huatuo-26m A large-scale medical question-and-answer dataset with over 26 million high-quality pairs, designed for natural language processing and machine learning applications in the medical field. 226
crownpku/small-chinese-corpus A collection of datasets and tools for NLP tasks on Chinese texts, including part-of-speech tagging, named entity recognition, and question answering. 529
michael-wzhu/promptcblue A large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain 328
hypjudy/sparkles Develops multimodal instruction-following models for open-ended dialogues across multiple images 43
clue-ai/chatyuan Large language model for dialogue support in multiple languages 1,903
hikariming/chat-dataset-baseline Provides a resource library for training Chinese conversation models with pre-processed datasets and a framework for fine-tuning the models 1,162