dstc8-schema-guided-dialogue
Dialogue Dataset
This dataset provides annotated conversations between humans and virtual assistants to train machine learning models for dialogue systems.
The Schema-Guided Dialogue Dataset
553 stars
38 watching
125 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list
assistantdatasetdialoguedialogue-systemsnlp-machine-learning
Related projects:
Repository | Description | Stars |
---|---|---|
ufal-dsg/alex_context_nlg_dataset | A dataset for training natural language generation models in dialogue systems by incorporating context information. | 23 |
thu-coai/cdial-gpt | A large-scale Chinese conversation dataset and pre-trained dialog models for text generation | 1,799 |
toyhom/chinese-medical-dialogue-data | A collection of medical dialogue data for training conversational AI models. | 1,264 |
ufal-dsg/tgen | A statistical natural language generator for spoken dialogue systems using two different algorithms to produce human-like responses | 206 |
2noise/chattts | A generative speech model designed to synthesize natural and expressive dialogue in interactive conversations. | 32,941 |
xfffff/gekko-datasets | A collection of pre-computed trading data in SQLite format for backtesting and analysis. | 172 |
shawnwun/nndial | A toolkit for building end-to-end trainable task-oriented dialogue models. | 348 |
x-plug/chatplug | A Chinese open-domain dialogue system with features like knowledge augmentation, personalization, and multi-skill capabilities. | 316 |
osdg-ai/osdg-data | A dataset of human-labeled text excerpts validated against the Sustainable Development Goals. | 28 |
geomagical/geosynth | A synthetic dataset designed to improve performance in indoor scene perception tasks by providing detailed labels and photorealistic images. | 40 |
rdong08/spatialdwls_dataset | Provides code and data for a spatial decision-making tool | 12 |
candlewill/dialog_corpus | A collection of datasets used to train and improve chatbot systems in both English and Chinese. | 2,033 |
radi-cho/datasetgpt | A command-line interface to generate textual datasets with Large Language Models | 293 |
korpling/salt | A flexible data model and API for representing linguistic data in a language-independent and theory-neutral way. | 15 |
google-research/cad-estate | A large dataset of 3D object and room layout annotations on RGB videos, designed to test automatic scene understanding methods. | 106 |