Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Max Wolff

Signal Strength and Noise Drive Feature Preference in CNN Image Classifiers

Jan 19, 2022

Max Wolff, Stuart Wolff

Figure 1 for Signal Strength and Noise Drive Feature Preference in CNN Image Classifiers

Figure 2 for Signal Strength and Noise Drive Feature Preference in CNN Image Classifiers

Figure 3 for Signal Strength and Noise Drive Feature Preference in CNN Image Classifiers

Figure 4 for Signal Strength and Noise Drive Feature Preference in CNN Image Classifiers

Abstract:Feature preference in Convolutional Neural Network (CNN) image classifiers is integral to their decision making process, and while the topic has been well studied, it is still not understood at a fundamental level. We test a range of task relevant feature attributes (including shape, texture, and color) with varying degrees of signal and noise in highly controlled CNN image classification experiments using synthetic datasets to determine feature preferences. We find that CNNs will prefer features with stronger signal strength and lower noise irrespective of whether the feature is texture, shape, or color. This provides guidance for a predictive model for task relevant feature preferences, demonstrates pathways for bias in machine models that can be avoided with careful controls on experimental setup, and suggests that comparisons between how humans and machines prefer task relevant features in vision classification tasks should be revisited. Code to reproduce experiments in this paper can be found at \url{https://github.com/mwolff31/signal_preference}.

* Accepted at SVRHM 2021

Via

Access Paper or Ask Questions

Attacking Neural Text Detectors

Feb 19, 2020

Max Wolff

Figure 1 for Attacking Neural Text Detectors

Figure 2 for Attacking Neural Text Detectors

Figure 3 for Attacking Neural Text Detectors

Figure 4 for Attacking Neural Text Detectors

Abstract:Machine learning based language models have recently made significant progress, which introduces a danger to spread misinformation. To combat this potential danger, several methods have been proposed for detecting text written by these language models. This paper presents two classes of black-box attacks on these detectors, one which randomly replaces characters with homoglyphs, and the other a simple scheme to purposefully misspell words. The homoglyph and misspelling attacks decrease a popular neural text detector's recall on neural text from 97.44% to 0.26% and 22.68%, respectively. Results also indicate that the attacks are transferable to other neural text detectors.

Via

Access Paper or Ask Questions