Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Increasing Students' Engagement to Reminder Emails Through Multi-Armed Bandits

Aug 10, 2022

Fernando J. Yanez, Angela Zavaleta-Bernuy, Ziwen Han, Michael Liut, Anna Rafferty, Joseph Jay Williams

Figure 1 for Increasing Students' Engagement to Reminder Emails Through Multi-Armed Bandits

Figure 2 for Increasing Students' Engagement to Reminder Emails Through Multi-Armed Bandits

Figure 3 for Increasing Students' Engagement to Reminder Emails Through Multi-Armed Bandits

Figure 4 for Increasing Students' Engagement to Reminder Emails Through Multi-Armed Bandits

Share this with someone who'll enjoy it:

Abstract:Conducting randomized experiments in education settings raises the question of how we can use machine learning techniques to improve educational interventions. Using Multi-Armed Bandits (MAB) algorithms like Thompson Sampling (TS) in adaptive experiments can increase students' chances of obtaining better outcomes by increasing the probability of assignment to the most optimal condition (arm), even before an intervention completes. This is an advantage over traditional A/B testing, which may allocate an equal number of students to both optimal and non-optimal conditions. The problem is the exploration-exploitation trade-off. Even though adaptive policies aim to collect enough information to allocate more students to better arms reliably, past work shows that this may not be enough exploration to draw reliable conclusions about whether arms differ. Hence, it is of interest to provide additional uniform random (UR) exploration throughout the experiment. This paper shows a real-world adaptive experiment on how students engage with instructors' weekly email reminders to build their time management habits. Our metric of interest is open email rates which tracks the arms represented by different subject lines. These are delivered following different allocation algorithms: UR, TS, and what we identified as TS{\dag} - which combines both TS and UR rewards to update its priors. We highlight problems with these adaptive algorithms - such as possible exploitation of an arm when there is no significant difference - and address their causes and consequences. Future directions includes studying situations where the early choice of the optimal arm is not ideal and how adaptive algorithms can address them.

* 6th Educational Data Mining in Computer Science Education (CSEDM) Workshop In conjunction with EDM 2022

View paper on

Share this with someone who'll enjoy it:

Title:Increasing Students' Engagement to Reminder Emails Through Multi-Armed Bandits

Paper and Code