Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yannik Keller

From Images to Connections: Can DQN with GNNs learn the Strategic Game of Hex?

Nov 22, 2023

Yannik Keller, Jannis Blüml, Gopika Sudhakaran, Kristian Kersting

Figure 1 for From Images to Connections: Can DQN with GNNs learn the Strategic Game of Hex?

Figure 2 for From Images to Connections: Can DQN with GNNs learn the Strategic Game of Hex?

Figure 3 for From Images to Connections: Can DQN with GNNs learn the Strategic Game of Hex?

Figure 4 for From Images to Connections: Can DQN with GNNs learn the Strategic Game of Hex?

Abstract:The gameplay of strategic board games such as chess, Go and Hex is often characterized by combinatorial, relational structures -- capturing distinct interactions and non-local patterns -- and not just images. Nonetheless, most common self-play reinforcement learning (RL) approaches simply approximate policy and value functions using convolutional neural networks (CNN). A key feature of CNNs is their relational inductive bias towards locality and translational invariance. In contrast, graph neural networks (GNN) can encode more complicated and distinct relational structures. Hence, we investigate the crucial question: Can GNNs, with their ability to encode complex connections, replace CNNs in self-play reinforcement learning? To this end, we do a comparison with Hex -- an abstract yet strategically rich board game -- serving as our experimental platform. Our findings reveal that GNNs excel at dealing with long range dependency situations in game states and are less prone to overfitting, but also showing a reduced proficiency in discerning local patterns. This suggests a potential paradigm shift, signaling the use of game-specific structures to reshape self-play reinforcement learning.

Via

Access Paper or Ask Questions

BERT-Defense: A Probabilistic Model Based on BERT to Combat Cognitively Inspired Orthographic Adversarial Attacks

Jun 02, 2021

Yannik Keller, Jan Mackensen, Steffen Eger

Figure 1 for BERT-Defense: A Probabilistic Model Based on BERT to Combat Cognitively Inspired Orthographic Adversarial Attacks

Figure 2 for BERT-Defense: A Probabilistic Model Based on BERT to Combat Cognitively Inspired Orthographic Adversarial Attacks

Figure 3 for BERT-Defense: A Probabilistic Model Based on BERT to Combat Cognitively Inspired Orthographic Adversarial Attacks

Figure 4 for BERT-Defense: A Probabilistic Model Based on BERT to Combat Cognitively Inspired Orthographic Adversarial Attacks

Abstract:Adversarial attacks expose important blind spots of deep learning systems. While word- and sentence-level attack scenarios mostly deal with finding semantic paraphrases of the input that fool NLP models, character-level attacks typically insert typos into the input stream. It is commonly thought that these are easier to defend via spelling correction modules. In this work, we show that both a standard spellchecker and the approach of Pruthi et al. (2019), which trains to defend against insertions, deletions and swaps, perform poorly on the character-level benchmark recently proposed in Eger and Benz (2020) which includes more challenging attacks such as visual and phonetic perturbations and missing word segmentations. In contrast, we show that an untrained iterative approach which combines context-independent character-level information with context-dependent information from BERT's masked language modeling can perform on par with human crowd-workers from Amazon Mechanical Turk (AMT) supervised via 3-shot learning.

* Findings of ACL 2021

Via

Access Paper or Ask Questions

AdaptiFont: Increasing Individuals' Reading Speed with a Generative Font Model and Bayesian Optimization

Apr 21, 2021

Florian Kadner, Yannik Keller, Constantin A. Rothkopf

Figure 1 for AdaptiFont: Increasing Individuals' Reading Speed with a Generative Font Model and Bayesian Optimization

Figure 2 for AdaptiFont: Increasing Individuals' Reading Speed with a Generative Font Model and Bayesian Optimization

Figure 3 for AdaptiFont: Increasing Individuals' Reading Speed with a Generative Font Model and Bayesian Optimization

Figure 4 for AdaptiFont: Increasing Individuals' Reading Speed with a Generative Font Model and Bayesian Optimization

Abstract:Digital text has become one of the primary ways of exchanging knowledge, but text needs to be rendered to a screen to be read. We present AdaptiFont, a human-in-the-loop system that is aimed at interactively increasing readability of text displayed on a monitor. To this end, we first learn a generative font space with non-negative matrix factorization from a set of classic fonts. In this space we generate new true-type-fonts through active learning, render texts with the new font, and measure individual users' reading speed. Bayesian optimization sequentially generates new fonts on the fly to progressively increase individuals' reading speed. The results of a user study show that this adaptive font generation system finds regions in the font space corresponding to high reading speeds, that these fonts significantly increase participants' reading speed, and that the found fonts are significantly different across individual readers.

* 18 pages, 11 figures

Via

Access Paper or Ask Questions