CDial-GPT
Conversation Dataset
A large-scale Chinese conversation dataset and pre-trained dialog models for text generation
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
2k stars
29 watching
252 forks
Language: Python
last commit: over 1 year ago dialoguegptgpt-2lcccpytorchtext-generation
Related projects:
Repository | Description | Stars |
---|---|---|
| A large-scale pre-trained dialogue model for Chinese language | 74 |
| Pre-trained chatbot models for Chinese open-domain dialogue systems | 306 |
| An advanced language model designed to generate human-like responses in various domains and applications | 101 |
| Large language model for dialogue support in multiple languages | 1,903 |
| An AI-powered text generation model trained on Chinese data to perform various tasks such as conversation, translation, and content creation. | 418 |
| A command-line interface to generate textual datasets with Large Language Models | 293 |
| This dataset provides annotated conversations between humans and virtual assistants to train machine learning models for dialogue systems. | 553 |
| A large language model designed to support long context conversations with improved efficiency and effectiveness | 38 |
| A collection of question-answer pairs extracted from online Chinese forums. | 236 |
| A collection of pre-trained GPT2 models and training scripts for multiple languages, including Chinese. | 1,717 |
| Provides a dataset of safety prompts to evaluate and improve the safety of large language models. | 880 |
| A collection of preprocessed Chinese conversation corpora for use in natural language processing tasks. | 1,089 |
| An intelligent system that enables automatic control and utilization of visual foundation models to interact with images in conversational settings. | 762 |
| A generative speech model designed to synthesize natural and expressive dialogue in interactive conversations. | 32,941 |
| Provides a resource library for training Chinese conversation models with pre-processed datasets and a framework for fine-tuning the models | 1,162 |