Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dittaya Wanvarie

Plot Writing From Pre-Trained Language Models

Jun 07, 2022

Yiping Jin, Vishakha Kadam, Dittaya Wanvarie

Figure 1 for Plot Writing From Pre-Trained Language Models

Figure 2 for Plot Writing From Pre-Trained Language Models

Figure 3 for Plot Writing From Pre-Trained Language Models

Figure 4 for Plot Writing From Pre-Trained Language Models

Abstract:Pre-trained language models (PLMs) fail to generate long-form narrative text because they do not consider global structure. As a result, the generated texts are often incohesive, repetitive, or lack content. Recent work in story generation reintroduced explicit content planning in the form of prompts, keywords, or semantic frames. Trained on large parallel corpora, these models can generate more logical event sequences and thus more contentful stories. However, these intermediate representations are often not in natural language and cannot be utilized by PLMs without fine-tuning. We propose generating story plots using off-the-shelf PLMs while maintaining the benefit of content planning to generate cohesive and contentful stories. Our proposed method, ScratchPlot, first prompts a PLM to compose a content plan. Then, we generate the story's body and ending conditioned on the content plan. Furthermore, we take a generate-and-rank approach by using additional PLMs to rank the generated (story, ending) pairs. We benchmark our method with various baselines and achieved superior results in both human and automatic evaluation.

* Accepted to INLG 2022; 16 pages, 11 figures

Via

Access Paper or Ask Questions

Seed Word Selection for Weakly-Supervised Text Classification with Unsupervised Error Estimation

Apr 20, 2021

Yiping Jin, Akshay Bhatia, Dittaya Wanvarie

Figure 1 for Seed Word Selection for Weakly-Supervised Text Classification with Unsupervised Error Estimation

Figure 2 for Seed Word Selection for Weakly-Supervised Text Classification with Unsupervised Error Estimation

Figure 3 for Seed Word Selection for Weakly-Supervised Text Classification with Unsupervised Error Estimation

Figure 4 for Seed Word Selection for Weakly-Supervised Text Classification with Unsupervised Error Estimation

Abstract:Weakly-supervised text classification aims to induce text classifiers from only a few user-provided seed words. The vast majority of previous work assumes high-quality seed words are given. However, the expert-annotated seed words are sometimes non-trivial to come up with. Furthermore, in the weakly-supervised learning setting, we do not have any labeled document to measure the seed words' efficacy, making the seed word selection process "a walk in the dark". In this work, we remove the need for expert-curated seed words by first mining (noisy) candidate seed words associated with the category names. We then train interim models with individual candidate seed words. Lastly, we estimate the interim models' error rate in an unsupervised manner. The seed words that yield the lowest estimated error rates are added to the final seed word set. A comprehensive evaluation of six binary classification tasks on four popular datasets demonstrates that the proposed method outperforms a baseline using only category name seed words and obtained comparable performance as a counterpart using expert-annotated seed words.

* Accepted to NAACL SRW 2021

Via

Access Paper or Ask Questions

Bootstrapping Large-Scale Fine-Grained Contextual Advertising Classifier from Wikipedia

Feb 12, 2021

Yiping Jin, Vishakha Kadam, Dittaya Wanvarie

Figure 1 for Bootstrapping Large-Scale Fine-Grained Contextual Advertising Classifier from Wikipedia

Figure 2 for Bootstrapping Large-Scale Fine-Grained Contextual Advertising Classifier from Wikipedia

Figure 3 for Bootstrapping Large-Scale Fine-Grained Contextual Advertising Classifier from Wikipedia

Figure 4 for Bootstrapping Large-Scale Fine-Grained Contextual Advertising Classifier from Wikipedia

Abstract:Contextual advertising provides advertisers with the opportunity to target the context which is most relevant to their ads. However, its power cannot be fully utilized unless we can target the page content using fine-grained categories, e.g., "coupe" vs. "hatchback" instead of "automotive" vs. "sport". The widely used advertising content taxonomy (IAB taxonomy) consists of 23 coarse-grained categories and 355 fine-grained categories. With the large number of categories, it becomes very challenging either to collect training documents to build a supervised classification model, or to compose expert-written rules in a rule-based classification system. Besides, in fine-grained classification, different categories often overlap or co-occur, making it harder to classify accurately. In this work, we propose wiki2cat, a method to tackle the problem of large-scaled fine-grained text classification by tapping on Wikipedia category graph. The categories in IAB taxonomy are first mapped to category nodes in the graph. Then the label is propagated across the graph to obtain a list of labeled Wikipedia documents to induce text classifiers. The method is ideal for large-scale classification problems since it does not require any manually-labeled document or hand-curated rules or keywords. The proposed method is benchmarked with various learning-based and keyword-based baselines and yields competitive performance on both publicly available datasets and a new dataset containing more than 300 fine-grained categories.

* Work in progress

Via

Access Paper or Ask Questions

Generating Coherent and Diverse Slogans with Sequence-to-Sequence Transformer

Feb 11, 2021

Yiping Jin, Akshay Bhatia, Dittaya Wanvarie, Phu T. V. Le

Figure 1 for Generating Coherent and Diverse Slogans with Sequence-to-Sequence Transformer

Figure 2 for Generating Coherent and Diverse Slogans with Sequence-to-Sequence Transformer

Figure 3 for Generating Coherent and Diverse Slogans with Sequence-to-Sequence Transformer

Figure 4 for Generating Coherent and Diverse Slogans with Sequence-to-Sequence Transformer

Abstract:Previous work in slogan generation focused on generating novel slogans by utilising templates mined from real slogans. While some such slogans can be catchy, they are often not coherent with the company's focus or style across their marketing communications because the templates are mined from other companies' slogans. We propose a sequence-to-sequence transformer model to generate slogans from a brief company description. A naive sequence-to-sequence model fine-tuned for slogan generation is prone to introducing false information, especially unrelated company names appearing in the training data. We use delexicalisation to address this problem and improve the generated slogans' quality by a large margin. Furthermore, we apply two simple but effective approaches to generate more diverse slogans. Firstly, we train a slogan generator conditioned on the industry. During inference time, by changing the industry, we can obtain different "flavours" of slogans. Secondly, instead of using only the company description as the input sequence, we sample random paragraphs from the company's website. Surprisingly, the model can generate meaningful slogans, even if the input sequence does not resemble a company description. We validate the effectiveness of the proposed method with both quantitative evaluation and qualitative evaluation. Our best model achieved a ROUGE-1/-2/-L F1 score of 53.13/33.30/46.49. Besides, human evaluators assigned the generated slogans an average score of 3.39 on a scale of 1-5, indicating the system can generate plausible slogans with a quality close to human-written ones (average score 3.55).

* Submitted to NLE journal

Via

Access Paper or Ask Questions

Learning Only from Relevant Keywords and Unlabeled Documents

Oct 30, 2019

Nontawat Charoenphakdee, Jongyeong Lee, Yiping Jin, Dittaya Wanvarie, Masashi Sugiyama

Figure 1 for Learning Only from Relevant Keywords and Unlabeled Documents

Figure 2 for Learning Only from Relevant Keywords and Unlabeled Documents

Figure 3 for Learning Only from Relevant Keywords and Unlabeled Documents

Figure 4 for Learning Only from Relevant Keywords and Unlabeled Documents

Abstract:We consider a document classification problem where document labels are absent but only relevant keywords of a target class and unlabeled documents are given. Although heuristic methods based on pseudo-labeling have been considered, theoretical understanding of this problem has still been limited. Moreover, previous methods cannot easily incorporate well-developed techniques in supervised text classification. In this paper, we propose a theoretically guaranteed learning framework that is simple to implement and has flexible choices of models, e.g., linear models or neural networks. We demonstrate how to optimize the area under the receiver operating characteristic curve (AUC) effectively and also discuss how to adjust it to optimize other well-known evaluation metrics such as the accuracy and F1-measure. Finally, we show the effectiveness of our framework using benchmark datasets.

* EMNLP-IJCNLP2019, fix typos in Theorem 1: change $\pi$ and $\pi'$ to $\theta$ and $\theta'$

Via

Access Paper or Ask Questions

Lukthung Classification Using Neural Networks on Lyrics and Audios

Aug 23, 2019

Kawisorn Kamtue, Kasina Euchukanonchai, Dittaya Wanvarie, Naruemon Pratanwanich

Figure 1 for Lukthung Classification Using Neural Networks on Lyrics and Audios

Figure 2 for Lukthung Classification Using Neural Networks on Lyrics and Audios

Figure 3 for Lukthung Classification Using Neural Networks on Lyrics and Audios

Figure 4 for Lukthung Classification Using Neural Networks on Lyrics and Audios

Abstract:Music genre classification is a widely researched topic in music information retrieval (MIR). Being able to automatically tag genres will benefit music streaming service providers such as JOOX, Apple Music, and Spotify for their content-based recommendation. However, most studies on music classification have been done on western songs which differ from Thai songs. Lukthung, a distinctive and long-established type of Thai music, is one of the most popular music genres in Thailand and has a specific group of listeners. In this paper, we develop neural networks to classify such Lukthung genre from others using both lyrics and audios. Words used in Lukthung songs are particularly poetical, and their musical styles are uniquely composed of traditional Thai instruments. We leverage these two main characteristics by building a lyrics model based on bag-of-words (BoW), and an audio model using a convolutional neural network (CNN) architecture. We then aggregate the intermediate features learned from both models to build a final classifier. Our results show that the proposed three models outperform all of the standard classifiers where the combined model yields the best $F_1$ score of 0.86, allowing Lukthung classification to be applicable to personalized recommendation for Thai audience.

* ICSEC 2019

Via

Access Paper or Ask Questions