Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:LLM-Resistant Math Word Problem Generation via Adversarial Attacks

Feb 27, 2024

Roy Xie, Chengxuan Huang, Junlin Wang, Bhuwan Dhingra

Figure 1 for LLM-Resistant Math Word Problem Generation via Adversarial Attacks

Figure 2 for LLM-Resistant Math Word Problem Generation via Adversarial Attacks

Figure 3 for LLM-Resistant Math Word Problem Generation via Adversarial Attacks

Figure 4 for LLM-Resistant Math Word Problem Generation via Adversarial Attacks

Share this with someone who'll enjoy it:

Abstract:Large language models (LLMs) have significantly transformed the educational landscape. As current plagiarism detection tools struggle to keep pace with LLMs' rapid advancements, the educational community faces the challenge of assessing students' true problem-solving abilities in the presence of LLMs. In this work, we explore a new paradigm for ensuring fair evaluation -- generating adversarial examples which preserve the structure and difficulty of the original questions aimed for assessment, but are unsolvable by LLMs. Focusing on the domain of math word problems, we leverage abstract syntax trees to structurally generate adversarial examples that cause LLMs to produce incorrect answers by simply editing the numeric values in the problems. We conduct experiments on various open- and closed-source LLMs, quantitatively and qualitatively demonstrating that our method significantly degrades their math problem-solving ability. We identify shared vulnerabilities among LLMs and propose a cost-effective approach to attack high-cost models. Additionally, we conduct automatic analysis on math problems and investigate the cause of failure to guide future research on LLM's mathematical capability.

* Code is available at https://github.com/ruoyuxie/adversarial_mwps_generation

View paper on

Share this with someone who'll enjoy it:

Title:LLM-Resistant Math Word Problem Generation via Adversarial Attacks

Paper and Code