Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shudong Hao

Understanding Crosslingual Transfer Mechanisms in Probabilistic Topic Modeling

Oct 13, 2018

Shudong Hao, Michael J. Paul

Figure 1 for Understanding Crosslingual Transfer Mechanisms in Probabilistic Topic Modeling

Figure 2 for Understanding Crosslingual Transfer Mechanisms in Probabilistic Topic Modeling

Figure 3 for Understanding Crosslingual Transfer Mechanisms in Probabilistic Topic Modeling

Figure 4 for Understanding Crosslingual Transfer Mechanisms in Probabilistic Topic Modeling

Abstract:Probabilistic topic modeling is a popular choice as the first step of crosslingual tasks to enable knowledge transfer and extract multilingual features. While many multilingual topic models have been developed, their assumptions on the training corpus are quite varied, and it is not clear how well the models can be applied under various training conditions. In this paper, we systematically study the knowledge transfer mechanisms behind different multilingual topic models, and through a broad set of experiments with four models on ten languages, we provide empirical insights that can inform the selection and future development of multilingual topic models.

Via

Access Paper or Ask Questions

Learning Multilingual Topics from Incomparable Corpus

Jun 11, 2018

Shudong Hao, Michael J. Paul

Figure 1 for Learning Multilingual Topics from Incomparable Corpus

Figure 2 for Learning Multilingual Topics from Incomparable Corpus

Figure 3 for Learning Multilingual Topics from Incomparable Corpus

Figure 4 for Learning Multilingual Topics from Incomparable Corpus

Abstract:Multilingual topic models enable crosslingual tasks by extracting consistent topics from multilingual corpora. Most models require parallel or comparable training corpora, which limits their ability to generalize. In this paper, we first demystify the knowledge transfer mechanism behind multilingual topic models by defining an alternative but equivalent formulation. Based on this analysis, we then relax the assumption of training data required by most existing models, creating a model that only requires a dictionary for training. Experiments show that our new method effectively learns coherent multilingual topics from partially and fully incomparable corpora with limited amounts of dictionary resources.

* To appear in International Conference on Computational Linguistics (COLING), 2018

Via

Access Paper or Ask Questions

Lessons from the Bible on Modern Topics: Low-Resource Multilingual Topic Model Evaluation

Apr 26, 2018

Shudong Hao, Jordan Boyd-Graber, Michael J. Paul

Figure 1 for Lessons from the Bible on Modern Topics: Low-Resource Multilingual Topic Model Evaluation

Figure 2 for Lessons from the Bible on Modern Topics: Low-Resource Multilingual Topic Model Evaluation

Figure 3 for Lessons from the Bible on Modern Topics: Low-Resource Multilingual Topic Model Evaluation

Figure 4 for Lessons from the Bible on Modern Topics: Low-Resource Multilingual Topic Model Evaluation

Abstract:Multilingual topic models enable document analysis across languages through coherent multilingual summaries of the data. However, there is no standard and effective metric to evaluate the quality of multilingual topics. We introduce a new intrinsic evaluation of multilingual topic models that correlates well with human judgments of multilingual topic coherence as well as performance in downstream applications. Importantly, we also study evaluation for low-resource languages. Because standard metrics fail to accurately measure topic quality when robust external resources are unavailable, we propose an adaptation model that improves the accuracy and reliability of these metrics in low-resource settings.

* North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), New Orleans, Louisiana. June 2018

Via

Access Paper or Ask Questions