Gossiping-Chinese-Corpus
Forum dataset
A collection of question-answer pairs extracted from online Chinese forums.
PTT 八卦版問答中文語料
236 stars
13 watching
36 forks
Language: Jupyter Notebook
last commit: about 1 year ago
Linked from 1 awesome list
chatbotchatbot-corpuschinese-chatbotchinese-corpuschinese-datasetchinese-nlpcorpusdatasetdialogpttquestion-answering
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | A collection of datasets used to train and improve chatbot systems in both English and Chinese. | 2,033 |
| | An insurance industry conversation corpus with pre-processed data for natural language processing and question answering tasks. | 1,019 |
| | Provides a resource library for training Chinese conversation models with pre-processed datasets and a framework for fine-tuning the models | 1,162 |
| | A conversational language model developed to improve understanding of complex instructions and Chinese vocabulary. | 62 |
| | A collection of datasets and tools for NLP tasks on Chinese texts, including part-of-speech tagging, named entity recognition, and question answering. | 529 |
| | A large-scale Chinese conversation dataset and pre-trained dialog models for text generation | 1,799 |
| | Develops a large language model capable of handling complex medical conversations with high accuracy and professionalism. | 324 |
| | Large language model for dialogue support in multiple languages | 1,903 |
| | A large-scale Chinese corpus for pre-training language models. | 927 |
| | Data collection and model development for a conversational AI chatbot focused on emotional wellness support in Korean. | 357 |
| | A collection of preprocessed Chinese conversation corpora for use in natural language processing tasks. | 1,089 |
| | Pre-trained chatbot models for Chinese open-domain dialogue systems | 306 |
| | An updated version of a large language model designed to improve performance on multiple tasks and datasets | 13 |
| | Develops large language models to support medical diagnoses and provide helpful suggestions | 59 |
| | Provides pre-trained models for Chinese language tasks with improved performance and smaller model sizes compared to existing models. | 806 |