Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jonathan Keane

Strategy Masking: A Method for Guardrails in Value-based Reinforcement Learning Agents

Jan 09, 2025

Jonathan Keane, Sam Keyser, Jeremy Kedziora

Abstract:The use of reward functions to structure AI learning and decision making is core to the current reinforcement learning paradigm; however, without careful design of reward functions, agents can learn to solve problems in ways that may be considered ``undesirable" or ``unethical. Without thorough understanding of the incentives a reward function creates, it can be difficult to impose principled yet general control mechanisms over its behavior. In this paper, we study methods for constructing guardrails for AI agents that use reward functions to learn decision making. We introduce a novel approach, which we call strategy masking, to explicitly learn and then suppress undesirable AI agent behavior. We apply our method to study lying in AI agents and show that strategy masking can effectively modify agent behavior by suppressing, or actively penalizing, the reward dimension for lying such that agents act more honestly while not compromising their ability to perform effectively.

Via

Access Paper or Ask Questions

Fingerspelling recognition in the wild with iterative visual attention

Aug 28, 2019

Bowen Shi, Aurora Martinez Del Rio, Jonathan Keane, Diane Brentari, Greg Shakhnarovich, Karen Livescu

Figure 1 for Fingerspelling recognition in the wild with iterative visual attention

Figure 2 for Fingerspelling recognition in the wild with iterative visual attention

Figure 3 for Fingerspelling recognition in the wild with iterative visual attention

Figure 4 for Fingerspelling recognition in the wild with iterative visual attention

Abstract:Sign language recognition is a challenging gesture sequence recognition problem, characterized by quick and highly coarticulated motion. In this paper we focus on recognition of fingerspelling sequences in American Sign Language (ASL) videos collected in the wild, mainly from YouTube and Deaf social media. Most previous work on sign language recognition has focused on controlled settings where the data is recorded in a studio environment and the number of signers is limited. Our work aims to address the challenges of real-life data, reducing the need for detection or segmentation modules commonly used in this domain. We propose an end-to-end model based on an iterative attention mechanism, without explicit hand detection or segmentation. Our approach dynamically focuses on increasingly high-resolution regions of interest. It outperforms prior work by a large margin. We also introduce a newly collected data set of crowdsourced annotations of fingerspelling in the wild, and show that performance can be further improved with this additional data set.

* ICCV 2019

Via

Access Paper or Ask Questions

American Sign Language fingerspelling recognition in the wild

Oct 26, 2018

Bowen Shi, Aurora Martinez Del Rio, Jonathan Keane, Jonathan Michaux, Diane Brentari, Greg Shakhnarovich, Karen Livescu

Figure 1 for American Sign Language fingerspelling recognition in the wild

Figure 2 for American Sign Language fingerspelling recognition in the wild

Figure 3 for American Sign Language fingerspelling recognition in the wild

Figure 4 for American Sign Language fingerspelling recognition in the wild

Abstract:We address the problem of American Sign Language fingerspelling recognition in the wild, using videos collected from websites. We introduce the largest data set available so far for the problem of fingerspelling recognition, and the first using naturally occurring video data. Using this data set, we present the first attempt to recognize fingerspelling sequences in this challenging setting. Unlike prior work, our video data is extremely challenging due to low frame rates and visual variability. To tackle the visual challenges, we train a special-purpose signing hand detector using a small subset of our data. Given the hand detector output, a sequence model decodes the hypothesized fingerspelled letter sequence. For the sequence model, we explore attention-based recurrent encoder-decoders and CTC-based approaches. As the first attempt at fingerspelling recognition in the wild, this work is intended to serve as a baseline for future work on sign language recognition in realistic conditions. We find that, as expected, letter error rates are much higher than in previous work on more controlled data, and we analyze the sources of error and effects of model variants.

Via

Access Paper or Ask Questions

Lexicon-Free Fingerspelling Recognition from Video: Data, Models, and Signer Adaptation

Sep 26, 2016

Taehwan Kim, Jonathan Keane, Weiran Wang, Hao Tang, Jason Riggle, Gregory Shakhnarovich, Diane Brentari, Karen Livescu

Figure 1 for Lexicon-Free Fingerspelling Recognition from Video: Data, Models, and Signer Adaptation

Figure 2 for Lexicon-Free Fingerspelling Recognition from Video: Data, Models, and Signer Adaptation

Figure 3 for Lexicon-Free Fingerspelling Recognition from Video: Data, Models, and Signer Adaptation

Figure 4 for Lexicon-Free Fingerspelling Recognition from Video: Data, Models, and Signer Adaptation

Abstract:We study the problem of recognizing video sequences of fingerspelled letters in American Sign Language (ASL). Fingerspelling comprises a significant but relatively understudied part of ASL. Recognizing fingerspelling is challenging for a number of reasons: It involves quick, small motions that are often highly coarticulated; it exhibits significant variation between signers; and there has been a dearth of continuous fingerspelling data collected. In this work we collect and annotate a new data set of continuous fingerspelling videos, compare several types of recognizers, and explore the problem of signer variation. Our best-performing models are segmental (semi-Markov) conditional random fields using deep neural network-based features. In the signer-dependent setting, our recognizers achieve up to about 92% letter accuracy. The multi-signer setting is much more challenging, but with neural network adaptation we achieve up to 83% letter accuracies in this setting.

* arXiv admin note: substantial text overlap with arXiv:1608.08339

Via

Access Paper or Ask Questions