Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding

Feb 19, 2020

Xiaodong Liu, Yu Wang, Jianshu Ji, Hao Cheng, Xueyun Zhu, Emmanuel Awa, Pengcheng He, Weizhu Chen, Hoifung Poon, Guihong Cao(+1 more)

Figure 1 for The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding

Figure 2 for The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding

Figure 3 for The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding

Figure 4 for The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding

Share this with someone who'll enjoy it:

Abstract:We present MT-DNN, an open-source natural language understanding (NLU) toolkit that makes it easy for researchers and developers to train customized deep learning models. Built upon PyTorch and Transformers, MT-DNN is designed to facilitate rapid customization for a broad spectrum of NLU tasks, using a variety of objectives (classification, regression, structured prediction) and text encoders (e.g., RNNs, BERT, RoBERTa, UniLM). A unique feature of MT-DNN is its built-in support for robust and transferable learning using the adversarial multi-task learning paradigm. To enable efficient production deployment, MT-DNN supports multi-task knowledge distillation, which can substantially compress a deep neural model without significant performance drop. We demonstrate the effectiveness of MT-DNN on a wide range of NLU applications across general and biomedical domains. The software and pre-trained models will be publicly available at https://github.com/namisan/mt-dnn.

* 9 pages, 3 figures and 3 tables

View paper on

Share this with someone who'll enjoy it:

Title:The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding

Paper and Code