Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kai-Ling Lo

GPoeT-2: A GPT-2 Based Poem Generator

May 18, 2022

Kai-Ling Lo, Rami Ariss, Philipp Kurz

Figure 1 for GPoeT-2: A GPT-2 Based Poem Generator

Figure 2 for GPoeT-2: A GPT-2 Based Poem Generator

Figure 3 for GPoeT-2: A GPT-2 Based Poem Generator

Figure 4 for GPoeT-2: A GPT-2 Based Poem Generator

Abstract:This project aims to produce the next volume of machine-generated poetry, a complex art form that can be structured and unstructured, and carries depth in the meaning between the lines. GPoeT-2 is based on fine-tuning a state of the art natural language model (i.e. GPT-2) to generate limericks, typically humorous structured poems consisting of five lines with a AABBA rhyming scheme. With a two-stage generation system utilizing both forward and reverse language modeling, GPoeT-2 is capable of freely generating limericks in diverse topics while following the rhyming structure without any seed phrase or a posteriori constraints.Based on the automated generation process, we explore a wide variety of evaluation metrics to quantify "good poetry," including syntactical correctness, lexical diversity, and subject continuity. Finally, we present a collection of 94 categorized limericks that rank highly on the explored "good poetry" metrics to provoke human creativity.

* Carnegie Mellon University 11-785: Intro to Deep Learning Final Project

Via

Access Paper or Ask Questions

Knowledge-Grounded Response Generation with Deep Attentional Latent-Variable Model

Mar 23, 2019

Hao-Tong Ye, Kai-Ling Lo, Shang-Yu Su, Yun-Nung Chen

Figure 1 for Knowledge-Grounded Response Generation with Deep Attentional Latent-Variable Model

Figure 2 for Knowledge-Grounded Response Generation with Deep Attentional Latent-Variable Model

Figure 3 for Knowledge-Grounded Response Generation with Deep Attentional Latent-Variable Model

Figure 4 for Knowledge-Grounded Response Generation with Deep Attentional Latent-Variable Model

Abstract:End-to-end dialogue generation has achieved promising results without using handcrafted features and attributes specific for each task and corpus. However, one of the fatal drawbacks in such approaches is that they are unable to generate informative utterances, so it limits their usage from some real-world conversational applications. This paper attempts at generating diverse and informative responses with a variational generation model, which contains a joint attention mechanism conditioning on the information from both dialogue contexts and extra knowledge.

* Published in DSTC7 workshop at AAAI 2019

Via

Access Paper or Ask Questions

Natural Language Generation by Hierarchical Decoding with Linguistic Patterns

Aug 09, 2018

Shang-Yu Su, Kai-Ling Lo, Yi-Ting Yeh, Yun-Nung Chen

Figure 1 for Natural Language Generation by Hierarchical Decoding with Linguistic Patterns

Figure 2 for Natural Language Generation by Hierarchical Decoding with Linguistic Patterns

Abstract:Natural language generation (NLG) is a critical component in spoken dialogue systems. Classic NLG can be divided into two phases: (1) sentence planning: deciding on the overall sentence structure, (2) surface realization: determining specific word forms and flattening the sentence structure into a string. Many simple NLG models are based on recurrent neural networks (RNN) and sequence-to-sequence (seq2seq) model, which basically contains an encoder-decoder structure; these NLG models generate sentences from scratch by jointly optimizing sentence planning and surface realization using a simple cross entropy loss training criterion. However, the simple encoder-decoder architecture usually suffers from generating complex and long sentences, because the decoder has to learn all grammar and diction knowledge. This paper introduces a hierarchical decoding NLG model based on linguistic patterns in different levels, and shows that the proposed method outperforms the traditional one with a smaller model size. Furthermore, the design of the hierarchical decoding is flexible and easily-extensible in various NLG systems.

* Published in NAACL-HLT 2018, the first two authors have equal contributions

Via

Access Paper or Ask Questions