awesome-nlg

A curated list of resources dedicated to Natural Language Generation (NLG)

GitHub

453 stars
18 watching
61 forks
last commit: about 1 year ago
Linked from 3 awesome lists

awesomeawesome-listnatural-language-generationnatural-language-understandingnlg

Awesome Natural Language Generation / Datasets

Alex Context NLG Dataset 23 about 8 years ago A dataset for NLG in dialogue systems in the public transport information domain
Box-score data 110 over 2 years ago This dataset consists of (human-written) NBA basketball game summaries aligned with their corresponding box- and line-scores
E2E This shared task focuses on recent end-to-end (E2E), data-driven NLG methods, which jointly learn sentence planning and surface realisation from non-aligned data
Neural-Wikipedian 10 about 6 years ago The repository contains the code along with the required corpora that were used in order to build a system that "learns" how to generate English biographies for Semantic Web triples
WeatherGov Computer-generated weather forecasts from weather.gov (US public forecast), along with corresponding weather data
WebNLG 69 over 3 years ago The enriched version of the WebNLG - a resource for evaluating common NLG tasks, including Discourse Ordering, Lexicalization and Referring Expression Generation
WikiBio - wikipedia biography dataset This dataset gathers 728,321 biographies from wikipedia. It aims at evaluating text generation algorithms
The Schema-Guided Dialogue Dataset 541 about 1 year ago The Schema-Guided Dialogue (SGD) dataset consists of over 20k annotated multi-domain, task-oriented conversations between a human and a virtual assistant
The Wikipedia company corpus Company descriptions collected from Wikipedia. The dataset contains semantic representations, short, and long descriptions for 51K companies in English
YelpNLG YelpNLG provides resources for natural language generation of restaurant reviews

Awesome Natural Language Generation / Dialog

Chatito 874 about 1 year ago Generate datasets for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!
NNDIAL 348 over 7 years ago NNDial is an open source toolkit for building end-to-end trainable task-oriented dialogue models
Plato 977 about 4 years ago This is the Plato Research Dialogue System, a flexible platform for developing conversational AI agents
RNNLG 489 over 5 years ago RNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains
TGen 204 almost 3 years ago Statistical NLG for spoken dialogue systems

Awesome Natural Language Generation / Evaluation

BLEURT: a Transfer Learning-Based Metric for Natural Language Generation 686 about 1 year ago
compare-mt 466 over 2 years ago A tool for holistic analysis of language generations systems
GEM a benchmark environment for NLG with a focus on its Evaluation, both through human annotations and automated Metrics
NLG-eval 1,344 about 2 months ago Evaluation code for various unsupervised automated metrics for Natural Language Generation
VizSeq 441 7 days ago A Visual Analysis Toolkit for Text Generation Tasks

Awesome Natural Language Generation / Grammar

OpenCCG 204 over 3 years ago OpenCCG library for parsing and realization with CCG
GrammaticalFramework A programming language for multilingual grammar applications
EasyCCG 60 almost 7 years ago CCG: All combinators, common grammar format, parsing to logical form, parameter estimation for probabilistic CCG
CCG Lab 27 over 1 year ago All combinators, common grammar format, parsing to logical form, parameter estimation for probabilistic CCG
CCGweb 5 almost 2 years ago A Web platform for parsing and annotation

Awesome Natural Language Generation / Libraries

Cron Expression Descriptor 1,008 3 months ago A .NET library that converts cron expressions into human readable descriptions
Number Words 197 over 3 years ago Convert a number to an approximated text expression: from '0.23' to 'less than a quarter'
Writebot A NodeJS library that makes it easier to use GPT-3 by using presets

Awesome Natural Language Generation / Narrative Generation

Random Story Generator 64 over 12 years ago Using Natural Language Generation (NLG) to create a random short story
Tracery 2,122 over 4 years ago A story-grammar generation library for JavaScript

Awesome Natural Language Generation / Neural Natural Language Generation

aitextgen 1,836 about 1 year ago A robust Python tool for text-based AI training and generation using GPT-2
graph-2-text 152 about 5 years ago Graph to sequence implemented in Pytorch combining Graph convolutional networks and opennmt-py
Image Caption Generator 148 almost 5 years ago A Neural Network based generative model for captioning images using Tensorflow
lightnlg 3 over 1 year ago A minimalistic codebase for finetuning and interacting with NLG models using PyTorch Lightning
PaperRobot: Incremental Draft Generation of Scientific Ideas 471 7 months ago We present a PaperRobot who performs as an automatic research assistant
PPLM 1,130 8 months ago Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models
Question Generation using hugstransformers 1,097 6 months ago Question generation is the task of automatically generating questions from a text paragraph
Texar 2,388 about 3 years ago Texar is a toolkit aiming to support a broad set of machine learning, especially natural language processing and text generation tasks
textgenrnn 4,939 about 2 years ago Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code
This Word Does Not Exist 1,020 over 2 years ago This is a project allows people to train a variant of GPT-2 that makes up words, definitions and examples from scratch
Transformers 132,640 8 days ago State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch
Summary Generation From Structured Data 183 over 5 years ago For converting information present in the form of structured data into natural language text

Awesome Natural Language Generation / Papers and Articles

2022: Repairing the Cracked Foundation: A Survey of Obstacles in Evaluation Practices for Generated Text
2021: Vision: NLG Can Help Humanise Data and AI
2020: The Curious Case of Neural Text Degeneration
2020: A Gold Standard Methodology for Evaluating Accuracy in Data-To-Text Systems
2020: Evaluating the state-of-the-art of End-to-End Natural Language Generation: The E2E NLG challenge
2020: How to generate text: using different decoding methods for language generation with Transformers
2020: Natural language generation: The commercial state ofthe art in 2020
2020: Turing-NLG: A 17-billion-parameter language model by Microsoft
2019: A Closer Look at Recent Results of Verb Selection for Data-to-Text NLG
2019: A Personalized Data-to-Text Support Tool for Cancer Patients
2019: Controlling Contents in Data-to-Document Generation with Human-Designed Topic Labels
2019: Generated Texts Must Be Accurate!
2019: Hotel Scribe: Generating High Variation Hotel Descriptions
2019: Revisiting Challenges in Data-to-Text Generation with Fact Grounding
2017: Survey of the State of the Art in NaturalLanguage Generation: Core tasks, applicationsand evaluation
2016: Natural Language Generation enhances human decision-making with uncertain information

Awesome Natural Language Generation / Products

Accelerated Text 796 over 1 year ago Automatically generate multiple natural language descriptions of your data varying in wording and structure
RosaeNLG An open-source library for node.js or client side (browser) execution, based on the Pug template engine, to generate texts in English, French, German and Italian
Twine An open-source tool for telling interactive, nonlinear stories

Awesome Natural Language Generation / Realizers

Genl 21 over 9 years ago Surface realiser (part of a Natural Language Generation system) using Tree Adjoining Grammar
JSrealB 23 14 days ago A JavaScript bilingual text realizer for web development
SimpleNLG 810 over 1 year ago Java API for Natural Language Generation
SimpleNLG DE 18 over 1 year ago German version of SimpleNLG 4
SimpleNLG-EnFr 25 over 6 years ago SimpleNLG-EnFr 1.1 is a bilingual English/French adaption of SimpleNLG v4.2

Awesome Natural Language Generation / Templating Languages

calyx 62 almost 2 years ago A Ruby library for generating text with recursive template grammars
nalgene 55 over 5 years ago Natural language generation language
StringTemplate Java template engine (with ports for C##, Objective-C, JavaScript, Scala) for generating source code, web pages, emails, or any other formatted text output

Awesome Natural Language Generation / Videos

Data-To-Text: Generating Textual Summaries of Complex Data - Ehud Reiter
Imitation Learning and its Application to Natural Language Generation
Natural Language Generation (Introduction)
Strata Data Conference | The future of natural language generation: 2017-2027
The Quest for Automated Story Generation - Mark Riedl

Backlinks from these awesome lists: