Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models

Dec 20, 2022

Jonas Belouadi, Steffen Eger

Figure 1 for ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models

Figure 2 for ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models

Figure 3 for ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models

Figure 4 for ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models

Share this with someone who'll enjoy it:

Abstract:State-of-the-art poetry generation systems are often complex. They either consist of task-specific model pipelines, incorporate prior knowledge in the form of manually created constraints or both. In contrast, end-to-end models would not suffer from the overhead of having to model prior knowledge and could learn the nuances of poetry from data alone, reducing the degree of human supervision required. In this work, we investigate end-to-end poetry generation conditioned on styles such as rhyme, meter, and alliteration. We identify and address lack of training data and mismatching tokenization algorithms as possible limitations of past attempts. In particular, we successfully pre-train and release ByGPT5, a new token-free decoder-only language model, and fine-tune it on a large custom corpus of English and German quatrains annotated with our styles. We show that ByGPT5 outperforms other models such as mT5, ByT5, GPT-2 and ChatGPT, while also being more parameter efficient and performing favorably compared to humans. In addition, we analyze its runtime performance and introspect the model's understanding of style conditions. We make our code, models, and datasets publicly available.

* Preprint

View paper on

Share this with someone who'll enjoy it:

Title:ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models

Paper and Code