Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:MMTM: Multi-Tasking Multi-Decoder Transformer for Math Word Problems

Jun 02, 2022

Keyur Faldu, Amit Sheth, Prashant Kikani, Darshan Patel

Figure 1 for MMTM: Multi-Tasking Multi-Decoder Transformer for Math Word Problems

Figure 2 for MMTM: Multi-Tasking Multi-Decoder Transformer for Math Word Problems

Figure 3 for MMTM: Multi-Tasking Multi-Decoder Transformer for Math Word Problems

Figure 4 for MMTM: Multi-Tasking Multi-Decoder Transformer for Math Word Problems

Share this with someone who'll enjoy it:

Abstract:Recently, quite a few novel neural architectures were derived to solve math word problems by predicting expression trees. These architectures varied from seq2seq models, including encoders leveraging graph relationships combined with tree decoders. These models achieve good performance on various MWPs datasets but perform poorly when applied to an adversarial challenge dataset, SVAMP. We present a novel model MMTM that leverages multi-tasking and multi-decoder during pre-training. It creates variant tasks by deriving labels using pre-order, in-order and post-order traversal of expression trees, and uses task-specific decoders in a multi-tasking framework. We leverage transformer architectures with lower dimensionality and initialize weights from RoBERTa model. MMTM model achieves better mathematical reasoning ability and generalisability, which we demonstrate by outperforming the best state of the art baseline models from Seq2Seq, GTS, and Graph2Tree with a relative improvement of 19.4% on an adversarial challenge dataset SVAMP.

* 10 pages, 3 figures, 3 tables

View paper on

Share this with someone who'll enjoy it:

Title:MMTM: Multi-Tasking Multi-Decoder Transformer for Math Word Problems

Paper and Code