dstc8-schema-guided-dialogue

Dialogue dataset

A collection of datasets and tools for developing virtual assistants that can understand and respond to human conversations

The Schema-Guided Dialogue Dataset

GitHub

548 stars
38 watching
124 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list

assistantdatasetdialoguedialogue-systemsnlp-machine-learning

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ufal-dsg/alex_context_nlg_dataset A dataset for training natural language generation models in dialogue systems by incorporating context information. 23
thu-coai/cdial-gpt A large-scale Chinese conversation dataset and pre-trained dialog models for text generation 1,782
toyhom/chinese-medical-dialogue-data A collection of medical dialogue data for training conversational AI models. 1,227
ufal-dsg/tgen A statistical natural language generator for spoken dialogue systems using two different algorithms to produce human-like responses 204
2noise/chattts A generative speech model designed to synthesize natural and expressive dialogue in interactive conversations. 32,347
xfffff/gekko-datasets A collection of pre-computed trading data in SQLite format for backtesting and analysis. 170
shawnwun/nndial A toolkit for building end-to-end trainable task-oriented dialogue models. 348
x-plug/chatplug A Chinese open-domain dialogue system with features like knowledge augmentation, personalization, and multi-skill capabilities. 314
osdg-ai/osdg-data A dataset of human-labeled text excerpts validated against the Sustainable Development Goals. 28
geomagical/geosynth A synthetic dataset designed to improve performance in indoor scene perception tasks by providing detailed labels and photorealistic images. 40
rdong08/spatialdwls_dataset Provides code and data for a spatial decision-making tool 12
candlewill/dialog_corpus A collection of datasets used to train and improve chatbot systems in both English and Chinese. 2,033
radi-cho/datasetgpt A command-line interface to generate textual datasets with Large Language Models 293
korpling/salt A flexible data model and API for representing linguistic data in a language-independent and theory-neutral way. 15
google-research/cad-estate A large dataset of 3D object and room layout annotations on RGB videos, designed to test automatic scene understanding methods. 105