Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lis Pereira

Targeted Adversarial Training for Natural Language Understanding

Apr 12, 2021

Lis Pereira, Xiaodong Liu, Hao Cheng, Hoifung Poon, Jianfeng Gao, Ichiro Kobayashi

Figure 1 for Targeted Adversarial Training for Natural Language Understanding

Figure 2 for Targeted Adversarial Training for Natural Language Understanding

Figure 3 for Targeted Adversarial Training for Natural Language Understanding

Figure 4 for Targeted Adversarial Training for Natural Language Understanding

Abstract:We present a simple yet effective Targeted Adversarial Training (TAT) algorithm to improve adversarial training for natural language understanding. The key idea is to introspect current mistakes and prioritize adversarial training steps to where the model errs the most. Experiments show that TAT can significantly improve accuracy over standard adversarial training on GLUE and attain new state-of-the-art zero-shot results on XNLI. Our code will be released at: https://github.com/namisan/mt-dnn.

* 9 pages, 4 tables, 3 figurers, NAACL 2021

Via

Access Paper or Ask Questions

Posterior Differential Regularization with f-divergence for Improving Model Robustness

Oct 23, 2020

Hao Cheng, Xiaodong Liu, Lis Pereira, Yaoliang Yu, Jianfeng Gao

Figure 1 for Posterior Differential Regularization with f-divergence for Improving Model Robustness

Figure 2 for Posterior Differential Regularization with f-divergence for Improving Model Robustness

Figure 3 for Posterior Differential Regularization with f-divergence for Improving Model Robustness

Figure 4 for Posterior Differential Regularization with f-divergence for Improving Model Robustness

Abstract:We address the problem of enhancing model robustness through regularization. Specifically, we focus on methods that regularize the model posterior difference between clean and noisy inputs. Theoretically, we provide a connection of two recent methods, Jacobian Regularization and Virtual Adversarial Training, under this framework. Additionally, we generalize the posterior differential regularization to the family of $f$-divergences and characterize the overall regularization framework in terms of Jacobian matrix. Empirically, we systematically compare those regularizations and standard BERT training on a diverse set of tasks to provide a comprehensive profile of their effect on model in-domain and out-of-domain generalization. For both fully supervised and semi-supervised settings, our experiments show that regularizing the posterior differential with $f$-divergence can result in well-improved model robustness. In particular, with a proper $f$-divergence, a BERT-base model can achieve comparable generalization as its BERT-large counterpart for in-domain, adversarial and domain shift scenarios, indicating the great potential of the proposed framework for boosting model generalization for NLP models.

Via

Access Paper or Ask Questions

Adversarial Training for Commonsense Inference

May 17, 2020

Lis Pereira, Xiaodong Liu, Fei Cheng, Masayuki Asahara, Ichiro Kobayashi

Figure 1 for Adversarial Training for Commonsense Inference

Figure 2 for Adversarial Training for Commonsense Inference

Figure 3 for Adversarial Training for Commonsense Inference

Abstract:We propose an AdversariaL training algorithm for commonsense InferenCE (ALICE). We apply small perturbations to word embeddings and minimize the resultant adversarial risk to regularize the model. We exploit a novel combination of two different approaches to estimate these perturbations: 1) using the true label and 2) using the model prediction. Without relying on any human-crafted features, knowledge bases, or additional datasets other than the target datasets, our model boosts the fine-tuning performance of RoBERTa, achieving competitive results on multiple reading comprehension datasets that require commonsense inference.

* ACL2020 RepL4NLP workshop
* 6 pages, Accepted to ACL2020 RepL4NLP workshop

Via

Access Paper or Ask Questions