alex_context_nlg_dataset

Dialogue Context Dataset

A dataset for training natural language generation models in dialogue systems by incorporating context information.

Dataset for NLG which contains preceding context along with each generation instance

GitHub

23 stars
4 watching
13 forks
last commit: about 8 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ufal-dsg/tgen A statistical natural language generator for spoken dialogue systems using two different algorithms to produce human-like responses 205
google-research-datasets/dstc8-schema-guided-dialogue A large-scale dialogue dataset designed to support the development of virtual assistants and dialogue systems, with a focus on task-oriented conversations and linguistic variation. 549
shawnwun/rnnlg A toolkit for benchmarking Natural Language Generation in Spoken Dialogue Systems 491
bwilcox-1234/chatscript A natural language processing tool for creating conversational dialogue systems 17
tjunlp-lab/shallow-discourse-annotation-for-chinese-ted-talks Provides annotated data and tools for annotating Chinese TED Talks with discourse-level properties. 8
nytud/hulu A collection of linguistic datasets and benchmarks for natural language understanding tasks 9
candlewill/dialog_corpus A collection of datasets used to train and improve chatbot systems in both English and Chinese. 2,033
thu-coai/cdial-gpt A large-scale Chinese conversation dataset and pre-trained dialog models for text generation 1,782
karthikncode/nlp-datasets A curated list of Natural Language Processing datasets used to train and evaluate NLP models. 919
nytud/happ A dataset of Hungarian translations of human-language examples to test anaphora resolution algorithms 1
kyubyong/css10 A collection of speech datasets for 10 languages to support text-to-speech tasks 465
thiagocf05/webnlg Provides intermediate representations of data for NLG tasks like Discourse Ordering and Lexicalization 69
toyhom/chinese-medical-dialogue-data A collection of medical dialogue data for training conversational AI models. 1,227
rdong08/spatialdwls_dataset Provides code and data for a spatial decision-making tool 12
xuefuzhao/instructionwild Creating a large-scale user-based instruction dataset for natural language processing research and development 453