CDial-GPT

Conversation Dataset

A large-scale Chinese conversation dataset and pre-trained dialog models for text generation

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

GitHub

2k stars

29 watching

252 forks

Language: Python

last commit: about 3 years ago

dialoguegptgpt-2lcccpytorchtext-generation

Related projects:

Repository	Description	Stars
thu-coai/opd	A large-scale pre-trained dialogue model for Chinese language	74
thu-coai/eva	Pre-trained chatbot models for Chinese open-domain dialogue systems	306
neukg/techgpt-2.0	An advanced language model designed to generate human-like responses in various domains and applications	101
clue-ai/chatyuan	Large language model for dialogue support in multiple languages	1,903
skyworkaigc/skytext-chinese-gpt3	An AI-powered text generation model trained on Chinese data to perform various tasks such as conversation, translation, and content creation.	418
radi-cho/datasetgpt	A command-line interface to generate textual datasets with Large Language Models	293
google-research-datasets/dstc8-schema-guided-dialogue	This dataset provides annotated conversations between humans and virtual assistants to train machine learning models for dialogue systems.	553
zcli-charlie/batgpt	A large language model designed to support long context conversations with improved efficiency and effectiveness	38
zake7749/gossiping-chinese-corpus	A collection of question-answer pairs extracted from online Chinese forums.	236
imcaspar/gpt2-ml	A collection of pre-trained GPT2 models and training scripts for multiple languages, including Chinese.	1,717
thu-coai/safety-prompts	Provides a dataset of safety prompts to evaluate and improve the safety of large language models.	880
aceimnorstuvwxz/dgk_lost_conv	A collection of preprocessed Chinese conversation corpora for use in natural language processing tasks.	1,089
ailab-cvc/gpt4tools	An intelligent system that enables automatic control and utilization of visual foundation models to interact with images in conversational settings.	762
2noise/chattts	A generative speech model designed to synthesize natural and expressive dialogue in interactive conversations.	32,941
hikariming/chat-dataset-baseline	Provides a resource library for training Chinese conversation models with pre-processed datasets and a framework for fine-tuning the models	1,162