Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Are You Sure? Challenging LLMs Leads to Performance Drops in The FlipFlop Experiment

Nov 14, 2023

Philippe Laban, Lidiya Murakhovs'ka, Caiming Xiong, Chien-Sheng Wu

Figure 1 for Are You Sure? Challenging LLMs Leads to Performance Drops in The FlipFlop Experiment

Figure 2 for Are You Sure? Challenging LLMs Leads to Performance Drops in The FlipFlop Experiment

Figure 3 for Are You Sure? Challenging LLMs Leads to Performance Drops in The FlipFlop Experiment

Figure 4 for Are You Sure? Challenging LLMs Leads to Performance Drops in The FlipFlop Experiment

Share this with someone who'll enjoy it:

Abstract:The interactive nature of Large Language Models (LLMs) theoretically allows models to refine and improve their answers, yet systematic analysis of the multi-turn behavior of LLMs remains limited. In this paper, we propose the FlipFlop experiment: in the first round of the conversation, an LLM responds to a prompt containing a classification task. In a second round, the LLM is challenged with a follow-up phrase like "Are you sure?", offering an opportunity for the model to reflect on its initial answer, and decide whether to confirm or flip its answer. A systematic study of nine LLMs on seven classification tasks reveals that models flip their answers on average 46% of the time and that all models see a deterioration of accuracy between their first and final prediction, with an average drop of 17%. The FlipFlop experiment illustrates the universality of sycophantic behavior in LLMs and provides a robust framework to analyze model behavior and evaluate potential solutions.

View paper on

Share this with someone who'll enjoy it:

Title:Are You Sure? Challenging LLMs Leads to Performance Drops in The FlipFlop Experiment

Paper and Code