Chinese-medical-dialogue-data
Medical dialogue dataset
A collection of medical dialogue data for training conversational AI models.
Chinese medical dialogue data 中文医疗对话数据集
1k stars
20 watching
251 forks
Language: Python
last commit: over 1 year ago Related projects:
Repository | Description | Stars |
---|---|---|
google-research-datasets/dstc8-schema-guided-dialogue | This dataset provides annotated conversations between humans and virtual assistants to train machine learning models for dialogue systems. | 553 |
thu-coai/cdial-gpt | A large-scale Chinese conversation dataset and pre-trained dialog models for text generation | 1,799 |
thu-coai/opd | A large-scale pre-trained dialogue model for Chinese language | 74 |
suprityoung/zhongjing | Develops a large language model capable of handling complex medical conversations with high accuracy and professionalism. | 324 |
michael-wzhu/chatmed | Develops and deploys large language models for Chinese medical consultations to improve answer accuracy | 531 |
zake7749/gossiping-chinese-corpus | A collection of question-answer pairs extracted from online Chinese forums. | 236 |
ufal-dsg/alex_context_nlg_dataset | A dataset for training natural language generation models in dialogue systems by incorporating context information. | 23 |
x-d-lab/sunsimiao | A large-scale Chinese medical language model trained on diverse data sources to provide accurate and reliable medical information | 407 |
xuefuzhao/instructionwild | Creating a large-scale user-based instruction dataset for natural language processing research and development | 455 |
freedomintelligence/huatuo-26m | A large-scale medical question-and-answer dataset with over 26 million high-quality pairs, designed for natural language processing and machine learning applications in the medical field. | 226 |
crownpku/small-chinese-corpus | A collection of datasets and tools for NLP tasks on Chinese texts, including part-of-speech tagging, named entity recognition, and question answering. | 529 |
michael-wzhu/promptcblue | A large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain | 328 |
hypjudy/sparkles | Develops multimodal instruction-following models for open-ended dialogues across multiple images | 43 |
clue-ai/chatyuan | Large language model for dialogue support in multiple languages | 1,903 |
hikariming/chat-dataset-baseline | Provides a resource library for training Chinese conversation models with pre-processed datasets and a framework for fine-tuning the models | 1,162 |