alex_context_nlg_dataset
Dialogue Context Dataset
A dataset for training natural language generation models in dialogue systems by incorporating context information.
Dataset for NLG which contains preceding context along with each generation instance
23 stars
4 watching
13 forks
last commit: about 8 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
ufal-dsg/tgen | A statistical natural language generator for spoken dialogue systems using two different algorithms to produce human-like responses | 205 |
google-research-datasets/dstc8-schema-guided-dialogue | A large-scale dialogue dataset designed to support the development of virtual assistants and dialogue systems, with a focus on task-oriented conversations and linguistic variation. | 549 |
shawnwun/rnnlg | A toolkit for benchmarking Natural Language Generation in Spoken Dialogue Systems | 491 |
bwilcox-1234/chatscript | A natural language processing tool for creating conversational dialogue systems | 17 |
tjunlp-lab/shallow-discourse-annotation-for-chinese-ted-talks | Provides annotated data and tools for annotating Chinese TED Talks with discourse-level properties. | 8 |
nytud/hulu | A collection of linguistic datasets and benchmarks for natural language understanding tasks | 9 |
candlewill/dialog_corpus | A collection of datasets used to train and improve chatbot systems in both English and Chinese. | 2,033 |
thu-coai/cdial-gpt | A large-scale Chinese conversation dataset and pre-trained dialog models for text generation | 1,782 |
karthikncode/nlp-datasets | A curated list of Natural Language Processing datasets used to train and evaluate NLP models. | 919 |
nytud/happ | A dataset of Hungarian translations of human-language examples to test anaphora resolution algorithms | 1 |
kyubyong/css10 | A collection of speech datasets for 10 languages to support text-to-speech tasks | 465 |
thiagocf05/webnlg | Provides intermediate representations of data for NLG tasks like Discourse Ordering and Lexicalization | 69 |
toyhom/chinese-medical-dialogue-data | A collection of medical dialogue data for training conversational AI models. | 1,227 |
rdong08/spatialdwls_dataset | Provides code and data for a spatial decision-making tool | 12 |
xuefuzhao/instructionwild | Creating a large-scale user-based instruction dataset for natural language processing research and development | 453 |