Gossiping-Chinese-Corpus
Forum dataset
A collection of question-answer pairs extracted from online Chinese forums.
PTT 八卦版問答中文語料
236 stars
13 watching
36 forks
Language: Jupyter Notebook
last commit: 4 months ago
Linked from 1 awesome list
chatbotchatbot-corpuschinese-chatbotchinese-corpuschinese-datasetchinese-nlpcorpusdatasetdialogpttquestion-answering
Related projects:
Repository | Description | Stars |
---|---|---|
| A collection of datasets used to train and improve chatbot systems in both English and Chinese. | 2,033 |
| An insurance industry conversation corpus with pre-processed data for natural language processing and question answering tasks. | 1,019 |
| Provides a resource library for training Chinese conversation models with pre-processed datasets and a framework for fine-tuning the models | 1,162 |
| A conversational language model developed to improve understanding of complex instructions and Chinese vocabulary. | 62 |
| A collection of datasets and tools for NLP tasks on Chinese texts, including part-of-speech tagging, named entity recognition, and question answering. | 529 |
| A large-scale Chinese conversation dataset and pre-trained dialog models for text generation | 1,799 |
| Develops a large language model capable of handling complex medical conversations with high accuracy and professionalism. | 324 |
| Large language model for dialogue support in multiple languages | 1,903 |
| A large-scale Chinese corpus for pre-training language models. | 927 |
| Data collection and model development for a conversational AI chatbot focused on emotional wellness support in Korean. | 357 |
| A collection of preprocessed Chinese conversation corpora for use in natural language processing tasks. | 1,089 |
| Pre-trained chatbot models for Chinese open-domain dialogue systems | 306 |
| An updated version of a large language model designed to improve performance on multiple tasks and datasets | 13 |
| Develops large language models to support medical diagnoses and provide helpful suggestions | 59 |
| Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. | 806 |