Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Bhargav Upadhyay

Efficient Reinforcement Learning for Unsupervised Controlled Text Generation

Apr 16, 2022

Bhargav Upadhyay, Akhilesh Sudhakar, Arjun Maheswaran

Figure 1 for Efficient Reinforcement Learning for Unsupervised Controlled Text Generation

Figure 2 for Efficient Reinforcement Learning for Unsupervised Controlled Text Generation

Figure 3 for Efficient Reinforcement Learning for Unsupervised Controlled Text Generation

Figure 4 for Efficient Reinforcement Learning for Unsupervised Controlled Text Generation

Abstract:Controlled text generation tasks such as unsupervised text style transfer have increasingly adopted the use of Reinforcement Learning (RL). A major challenge in applying RL to such tasks is the sparse reward, which is available only after the full text is generated. Sparse rewards, combined with a large action space make RL training sample-inefficient and difficult to converge. Recently proposed reward-shaping strategies to address this issue have shown only negligible gains. In contrast, this work proposes a novel approach that provides dense rewards to each generated token. We evaluate our approach by its usage in unsupervised text style transfer. Averaged across datasets, our style transfer system improves upon current state-of-art systems by 21\% on human evaluation and 12\% on automatic evaluation. Upon ablated comparison with the current reward shaping approach (the `roll-out strategy'), using dense rewards improves the overall style transfer quality by 22\% based on human evaluation. Further the RL training is 2.5 times as sample efficient, and 7 times faster.

* 10 pages, 2 figures, 4 tables

Via

Access Paper or Ask Questions

Transforming Delete, Retrieve, Generate Approach for Controlled Text Style Transfer

Aug 25, 2019

Akhilesh Sudhakar, Bhargav Upadhyay, Arjun Maheswaran

Figure 1 for Transforming Delete, Retrieve, Generate Approach for Controlled Text Style Transfer

Figure 2 for Transforming Delete, Retrieve, Generate Approach for Controlled Text Style Transfer

Figure 3 for Transforming Delete, Retrieve, Generate Approach for Controlled Text Style Transfer

Figure 4 for Transforming Delete, Retrieve, Generate Approach for Controlled Text Style Transfer

Abstract:Text style transfer is the task of transferring the style of text having certain stylistic attributes, while preserving non-stylistic or content information. In this work we introduce the Generative Style Transformer (GST) - a new approach to rewriting sentences to a target style in the absence of parallel style corpora. GST leverages the power of both, large unsupervised pre-trained language models as well as the Transformer. GST is a part of a larger `Delete Retrieve Generate' framework, in which we also propose a novel method of deleting style attributes from the source sentence by exploiting the inner workings of the Transformer. Our models outperform state-of-art systems across 5 datasets on sentiment, gender and political slant transfer. We also propose the use of the GLEU metric as an automatic metric of evaluation of style transfer, which we found to compare better with human ratings than the predominantly used BLEU score.

* 11 pages, 6 Tables, 2 Figures, Accepted at 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP - 2019)

Via

Access Paper or Ask Questions