Chinese-medical-dialogue-data

Medical dialogue dataset

A collection of medical dialogue data for training conversational AI models.

Chinese medical dialogue data 中文医疗对话数据集

1k stars

20 watching

251 forks

Language: Python

last commit: almost 3 years ago

Related projects:

Repository	Description	Stars
google-research-datasets/dstc8-schema-guided-dialogue	This dataset provides annotated conversations between humans and virtual assistants to train machine learning models for dialogue systems.	553
thu-coai/cdial-gpt	A large-scale Chinese conversation dataset and pre-trained dialog models for text generation	1,799
thu-coai/opd	A large-scale pre-trained dialogue model for Chinese language	74
suprityoung/zhongjing	Develops a large language model capable of handling complex medical conversations with high accuracy and professionalism.	324
michael-wzhu/chatmed	Develops and deploys large language models for Chinese medical consultations to improve answer accuracy	531
zake7749/gossiping-chinese-corpus	A collection of question-answer pairs extracted from online Chinese forums.	236
ufal-dsg/alex_context_nlg_dataset	A dataset for training natural language generation models in dialogue systems by incorporating context information.	23
x-d-lab/sunsimiao	A large-scale Chinese medical language model trained on diverse data sources to provide accurate and reliable medical information	407
xuefuzhao/instructionwild	Creating a large-scale user-based instruction dataset for natural language processing research and development	455
freedomintelligence/huatuo-26m	A large-scale medical question-and-answer dataset with over 26 million high-quality pairs, designed for natural language processing and machine learning applications in the medical field.	226
crownpku/small-chinese-corpus	A collection of datasets and tools for NLP tasks on Chinese texts, including part-of-speech tagging, named entity recognition, and question answering.	529
michael-wzhu/promptcblue	A large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain	328
hypjudy/sparkles	Develops multimodal instruction-following models for open-ended dialogues across multiple images	43
clue-ai/chatyuan	Large language model for dialogue support in multiple languages	1,903
hikariming/chat-dataset-baseline	Provides a resource library for training Chinese conversation models with pre-processed datasets and a framework for fine-tuning the models	1,162