Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jiaxuan Wang

Predictive Response Optimization: Using Reinforcement Learning to Fight Online Social Network Abuse

Feb 24, 2025

Garrett Wilson, Geoffrey Goh, Yan Jiang, Ajay Gupta, Jiaxuan Wang, David Freeman, Francesco Dinuzzo

Abstract:Detecting phishing, spam, fake accounts, data scraping, and other malicious activity in online social networks (OSNs) is a problem that has been studied for well over a decade, with a number of important results. Nearly all existing works on abuse detection have as their goal producing the best possible binary classifier; i.e., one that labels unseen examples as "benign" or "malicious" with high precision and recall. However, no prior published work considers what comes next: what does the service actually do after it detects abuse? In this paper, we argue that detection as described in previous work is not the goal of those who are fighting OSN abuse. Rather, we believe the goal to be selecting actions (e.g., ban the user, block the request, show a CAPTCHA, or "collect more evidence") that optimize a tradeoff between harm caused by abuse and impact on benign users. With this framing, we see that enlarging the set of possible actions allows us to move the Pareto frontier in a way that is unattainable by simply tuning the threshold of a binary classifier. To demonstrate the potential of our approach, we present Predictive Response Optimization (PRO), a system based on reinforcement learning that utilizes available contextual information to predict future abuse and user-experience metrics conditioned on each possible action, and select actions that optimize a multi-dimensional tradeoff between abuse/harm and impact on user experience. We deployed versions of PRO targeted at stopping automated activity on Instagram and Facebook. In both cases our experiments showed that PRO outperforms a baseline classification system, reducing abuse volume by 59% and 4.5% (respectively) with no negative impact to users. We also present several case studies that demonstrate how PRO can quickly and automatically adapt to changes in business constraints, system behavior, and/or adversarial tactics.

* To appear in USENIX Security 2025

Via

Access Paper or Ask Questions

Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras

Jul 19, 2022

Fangwen Shu, Jiaxuan Wang, Alain Pagani, Didier Stricker

Figure 1 for Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras

Figure 2 for Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras

Figure 3 for Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras

Figure 4 for Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras

Abstract:This paper demonstrates a visual SLAM system that utilizes point and line cloud for robust camera localization, simultaneously, with an embedded piece-wise planar reconstruction (PPR) module which in all provides a structural map. To build a scale consistent map in parallel with tracking, such as employing a single camera brings the challenge of reconstructing geometric primitives with scale ambiguity, and further introduces the difficulty in graph optimization of bundle adjustment (BA). We address these problems by proposing several run-time optimizations on the reconstructed lines and planes. The system is then extended with depth and stereo sensors based on the design of the monocular framework. The results show that our proposed SLAM tightly incorporates the semantic features to boost both frontend tracking as well as backend optimization. We evaluate our system exhaustively on various datasets, and open-source our code for the community (https://github.com/PeterFWS/Structure-PLP-SLAM).

* The pre-print version, v2 add supplementary materials, code open-source: https://github.com/PeterFWS/Structure-PLP-SLAM

Via

Access Paper or Ask Questions

Shapley Flow: A Graph-based Approach to Interpreting Model Predictions

Nov 13, 2020

Jiaxuan Wang, Jenna Wiens, Scott Lundberg

Figure 1 for Shapley Flow: A Graph-based Approach to Interpreting Model Predictions

Figure 2 for Shapley Flow: A Graph-based Approach to Interpreting Model Predictions

Figure 3 for Shapley Flow: A Graph-based Approach to Interpreting Model Predictions

Figure 4 for Shapley Flow: A Graph-based Approach to Interpreting Model Predictions

Abstract:Many existing approaches for estimating feature importance are problematic because they ignore or hide dependencies among features. A causal graph, which encodes the relationships among input variables, can aid in assigning feature importance. However, current approaches that assign credit to nodes in the causal graph fail to explain the entire graph. In light of these limitations, we propose Shapley Flow, a novel approach to interpreting machine learning models. It considers the entire causal graph, and assigns credit to \textit{edges} instead of treating nodes as the fundamental unit of credit assignment. Shapley Flow is the unique solution to a generalization of the Shapley value axioms to directed acyclic graphs. We demonstrate the benefit of using Shapley Flow to reason about the impact of a model's input on its output. In addition to maintaining insights from existing approaches, Shapley Flow extends the flat, set-based, view prevalent in game theory based explanation methods to a deeper, \textit{graph-based}, view. This graph-based view enables users to understand the flow of importance through a system, and reason about potential interventions.

* Corrected a typo for the definition of Boundary consistency on page 5

Via

Access Paper or Ask Questions

AdaSGD: Bridging the gap between SGD and Adam

Jun 30, 2020

Jiaxuan Wang, Jenna Wiens

Figure 1 for AdaSGD: Bridging the gap between SGD and Adam

Figure 2 for AdaSGD: Bridging the gap between SGD and Adam

Figure 3 for AdaSGD: Bridging the gap between SGD and Adam

Figure 4 for AdaSGD: Bridging the gap between SGD and Adam

Abstract:In the context of stochastic gradient descent(SGD) and adaptive moment estimation (Adam),researchers have recently proposed optimization techniques that transition from Adam to SGD with the goal of improving both convergence and generalization performance. However, precisely how each approach trades off early progress and generalization is not well understood; thus, it is unclear when or even if, one should transition from one approach to the other. In this work, by first studying the convex setting, we identify potential contributors to observed differences in performance between SGD and Adam. In particular,we provide theoretical insights for when and why Adam outperforms SGD and vice versa. We ad-dress the performance gap by adapting a single global learning rate for SGD, which we refer to as AdaSGD. We justify this proposed approach with empirical analyses in non-convex settings. On several datasets that span three different domains,we demonstrate how AdaSGD combines the benefits of both SGD and Adam, eliminating the need for approaches that transition from Adam to SGD.

Via

Access Paper or Ask Questions

Relaxed Weight Sharing: Effectively Modeling Time-Varying Relationships in Clinical Time-Series

Jun 07, 2019

Jeeheh Oh, Jiaxuan Wang, Shengpu Tang, Michael Sjoding, Jenna Wiens

Figure 1 for Relaxed Weight Sharing: Effectively Modeling Time-Varying Relationships in Clinical Time-Series

Figure 2 for Relaxed Weight Sharing: Effectively Modeling Time-Varying Relationships in Clinical Time-Series

Figure 3 for Relaxed Weight Sharing: Effectively Modeling Time-Varying Relationships in Clinical Time-Series

Figure 4 for Relaxed Weight Sharing: Effectively Modeling Time-Varying Relationships in Clinical Time-Series

Abstract:Recurrent neural networks (RNNs) are commonly applied to clinical time-series data with the goal of learning patient risk stratification models. Their effectiveness is due, in part, to their use of parameter sharing over time (i.e., cells are repeated hence the name recurrent). We hypothesize, however, that this trait also contributes to the increased difficulty such models have with learning relationships that change over time. Conditional shift, i.e., changes in the relationship between the input X and the output y, arises if the risk factors for the event of interest change over the course of a patient admission. While in theory, RNNs and gated RNNs (e.g., LSTMs) in particular should be capable of learning time-varying relationships, when training data are limited, such models often fail to accurately capture these dynamics. We illustrate the advantages and disadvantages of complete weight sharing (RNNs) by comparing an LSTM with shared parameters to a sequential architecture with time-varying parameters on three clinically-relevant prediction tasks: acute respiratory failure (ARF), shock, and in-hospital mortality. In experiments using synthetic data, we demonstrate how weight sharing in LSTMs leads to worse performance in the presence of conditional shift. To improve upon the dichotomy between complete weight sharing vs. no weight sharing, we propose a novel RNN formulation based on a mixture model in which we relax weight sharing over time. The proposed method outperforms standard LSTMs and other state-of-the-art baselines across all tasks. In settings with limited data, relaxed weight sharing can lead to improved patient risk stratification performance.

Via

Access Paper or Ask Questions

Learning to Exploit Invariances in Clinical Time-Series Data using Sequence Transformer Networks

Aug 21, 2018

Jeeheh Oh, Jiaxuan Wang, Jenna Wiens

Figure 1 for Learning to Exploit Invariances in Clinical Time-Series Data using Sequence Transformer Networks

Figure 2 for Learning to Exploit Invariances in Clinical Time-Series Data using Sequence Transformer Networks

Figure 3 for Learning to Exploit Invariances in Clinical Time-Series Data using Sequence Transformer Networks

Figure 4 for Learning to Exploit Invariances in Clinical Time-Series Data using Sequence Transformer Networks

Abstract:Recently, researchers have started applying convolutional neural networks (CNNs) with one-dimensional convolutions to clinical tasks involving time-series data. This is due, in part, to their computational efficiency, relative to recurrent neural networks and their ability to efficiently exploit certain temporal invariances, (e.g., phase invariance). However, it is well-established that clinical data may exhibit many other types of invariances (e.g., scaling). While preprocessing techniques, (e.g., dynamic time warping) may successfully transform and align inputs, their use often requires one to identify the types of invariances in advance. In contrast, we propose the use of Sequence Transformer Networks, an end-to-end trainable architecture that learns to identify and account for invariances in clinical time-series data. Applied to the task of predicting in-hospital mortality, our proposed approach achieves an improvement in the area under the receiver operating characteristic curve (AUROC) relative to a baseline CNN (AUROC=0.851 vs. AUROC=0.838). Our results suggest that a variety of valuable invariances can be learned directly from the data.

Via

Access Paper or Ask Questions

Learning Credible Models

Jun 07, 2018

Jiaxuan Wang, Jeeheh Oh, Haozhu Wang, Jenna Wiens

Abstract:In many settings, it is important that a model be capable of providing reasons for its predictions (i.e., the model must be interpretable). However, the model's reasoning may not conform with well-established knowledge. In such cases, while interpretable, the model lacks \textit{credibility}. In this work, we formally define credibility in the linear setting and focus on techniques for learning models that are both accurate and credible. In particular, we propose a regularization penalty, expert yielded estimates (EYE), that incorporates expert knowledge about well-known relationships among covariates and the outcome of interest. We give both theoretical and empirical results comparing our proposed method to several other regularization techniques. Across a range of settings, experiments on both synthetic and real data show that models learned using the EYE penalty are significantly more credible than those learned using other penalties. Applied to a large-scale patient risk stratification task, our proposed technique results in a model whose top features overlap significantly with known clinical risk factors, while still achieving good predictive performance.

Via

Access Paper or Ask Questions

The Advantage of Doubling: A Deep Reinforcement Learning Approach to Studying the Double Team in the NBA

Mar 08, 2018

Jiaxuan Wang, Ian Fox, Jonathan Skaza, Nick Linck, Satinder Singh, Jenna Wiens

Figure 1 for The Advantage of Doubling: A Deep Reinforcement Learning Approach to Studying the Double Team in the NBA

Figure 2 for The Advantage of Doubling: A Deep Reinforcement Learning Approach to Studying the Double Team in the NBA

Figure 3 for The Advantage of Doubling: A Deep Reinforcement Learning Approach to Studying the Double Team in the NBA

Figure 4 for The Advantage of Doubling: A Deep Reinforcement Learning Approach to Studying the Double Team in the NBA

Abstract:During the 2017 NBA playoffs, Celtics coach Brad Stevens was faced with a difficult decision when defending against the Cavaliers: "Do you double and risk giving up easy shots, or stay at home and do the best you can?" It's a tough call, but finding a good defensive strategy that effectively incorporates doubling can make all the difference in the NBA. In this paper, we analyze double teaming in the NBA, quantifying the trade-off between risk and reward. Using player trajectory data pertaining to over 643,000 possessions, we identified when the ball handler was double teamed. Given these data and the corresponding outcome (i.e., was the defense successful), we used deep reinforcement learning to estimate the quality of the defensive actions. We present qualitative and quantitative results summarizing our learned defensive strategy for defending. We show that our policy value estimates are predictive of points per possession and win percentage. Overall, the proposed framework represents a step toward a more comprehensive understanding of defensive strategies in the NBA.

* Accepted to MIT Sloan Sports Analytics 2018. First two authors contributed equally

Via

Access Paper or Ask Questions