Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tong Jian

Pruning Adversarially Robust Neural Networks without Adversarial Examples

Oct 09, 2022

Tong Jian, Zifeng Wang, Yanzhi Wang, Jennifer Dy, Stratis Ioannidis

Figure 1 for Pruning Adversarially Robust Neural Networks without Adversarial Examples

Figure 2 for Pruning Adversarially Robust Neural Networks without Adversarial Examples

Figure 3 for Pruning Adversarially Robust Neural Networks without Adversarial Examples

Figure 4 for Pruning Adversarially Robust Neural Networks without Adversarial Examples

Abstract:Adversarial pruning compresses models while preserving robustness. Current methods require access to adversarial examples during pruning. This significantly hampers training efficiency. Moreover, as new adversarial attacks and training methods develop at a rapid rate, adversarial pruning methods need to be modified accordingly to keep up. In this work, we propose a novel framework to prune a previously trained robust neural network while maintaining adversarial robustness, without further generating adversarial examples. We leverage concurrent self-distillation and pruning to preserve knowledge in the original model as well as regularizing the pruned model via the Hilbert-Schmidt Information Bottleneck. We comprehensively evaluate our proposed framework and show its superior performance in terms of both adversarial robustness and efficiency when pruning architectures trained on the MNIST, CIFAR-10, and CIFAR-100 datasets against five state-of-the-art attacks. Code is available at https://github.com/neu-spiral/PwoA/.

* Published at ICDM 2022 as a conference paper

Via

Access Paper or Ask Questions

SparCL: Sparse Continual Learning on the Edge

Sep 20, 2022

Zifeng Wang, Zheng Zhan, Yifan Gong, Geng Yuan, Wei Niu, Tong Jian, Bin Ren, Stratis Ioannidis, Yanzhi Wang, Jennifer Dy

Figure 1 for SparCL: Sparse Continual Learning on the Edge

Figure 2 for SparCL: Sparse Continual Learning on the Edge

Figure 3 for SparCL: Sparse Continual Learning on the Edge

Figure 4 for SparCL: Sparse Continual Learning on the Edge

Abstract:Existing work in continual learning (CL) focuses on mitigating catastrophic forgetting, i.e., model performance deterioration on past tasks when learning a new task. However, the training efficiency of a CL system is under-investigated, which limits the real-world application of CL systems under resource-limited scenarios. In this work, we propose a novel framework called Sparse Continual Learning(SparCL), which is the first study that leverages sparsity to enable cost-effective continual learning on edge devices. SparCL achieves both training acceleration and accuracy preservation through the synergy of three aspects: weight sparsity, data efficiency, and gradient sparsity. Specifically, we propose task-aware dynamic masking (TDM) to learn a sparse network throughout the entire CL process, dynamic data removal (DDR) to remove less informative training data, and dynamic gradient masking (DGM) to sparsify the gradient updates. Each of them not only improves efficiency, but also further mitigates catastrophic forgetting. SparCL consistently improves the training efficiency of existing state-of-the-art (SOTA) CL methods by at most 23X less training FLOPs, and, surprisingly, further improves the SOTA accuracy by at most 1.7%. SparCL also outperforms competitive baselines obtained from adapting SOTA sparse training methods to the CL setting in both efficiency and accuracy. We also evaluate the effectiveness of SparCL on a real mobile phone, further indicating the practical potential of our method.

* Published at NeurIPS 2022 as a conference paper

Via

Access Paper or Ask Questions

Deep Learning on Multimodal Sensor Data at the Wireless Edge for Vehicular Network

Jan 12, 2022

Batool Salehi, Guillem Reus-Muns, Debashri Roy, Zifeng Wang, Tong Jian, Jennifer Dy, Stratis Ioannidis, Kaushik Chowdhury

Figure 1 for Deep Learning on Multimodal Sensor Data at the Wireless Edge for Vehicular Network

Figure 2 for Deep Learning on Multimodal Sensor Data at the Wireless Edge for Vehicular Network

Figure 3 for Deep Learning on Multimodal Sensor Data at the Wireless Edge for Vehicular Network

Figure 4 for Deep Learning on Multimodal Sensor Data at the Wireless Edge for Vehicular Network

Abstract:Beam selection for millimeter-wave links in a vehicular scenario is a challenging problem, as an exhaustive search among all candidate beam pairs cannot be assuredly completed within short contact times. We solve this problem via a novel expediting beam selection by leveraging multimodal data collected from sensors like LiDAR, camera images, and GPS. We propose individual modality and distributed fusion-based deep learning (F-DL) architectures that can execute locally as well as at a mobile edge computing center (MEC), with a study on associated tradeoffs. We also formulate and solve an optimization problem that considers practical beam-searching, MEC processing and sensor-to-MEC data delivery latency overheads for determining the output dimensions of the above F-DL architectures. Results from extensive evaluations conducted on publicly available synthetic and home-grown real-world datasets reveal 95% and 96% improvement in beam selection speed over classical RF-only beam sweeping, respectively. F-DL also outperforms the state-of-the-art techniques by 20-22% in predicting top-10 best beam pairs.

* 16 pages

Via

Access Paper or Ask Questions

Revisiting Hilbert-Schmidt Information Bottleneck for Adversarial Robustness

Jun 04, 2021

Zifeng Wang, Tong Jian, Aria Masoomi, Stratis Ioannidis, Jennifer Dy

Figure 1 for Revisiting Hilbert-Schmidt Information Bottleneck for Adversarial Robustness

Figure 2 for Revisiting Hilbert-Schmidt Information Bottleneck for Adversarial Robustness

Figure 3 for Revisiting Hilbert-Schmidt Information Bottleneck for Adversarial Robustness

Figure 4 for Revisiting Hilbert-Schmidt Information Bottleneck for Adversarial Robustness

Abstract:We investigate the HSIC (Hilbert-Schmidt independence criterion) bottleneck as a regularizer for learning an adversarially robust deep neural network classifier. We show that the HSIC bottleneck enhances robustness to adversarial attacks both theoretically and experimentally. Our experiments on multiple benchmark datasets and architectures demonstrate that incorporating an HSIC bottleneck regularizer attains competitive natural accuracy and improves adversarial robustness, both with and without adversarial examples during training.

Via

Access Paper or Ask Questions

Learn-Prune-Share for Lifelong Learning

Dec 13, 2020

Zifeng Wang, Tong Jian, Kaushik Chowdhury, Yanzhi Wang, Jennifer Dy, Stratis Ioannidis

Figure 1 for Learn-Prune-Share for Lifelong Learning

Figure 2 for Learn-Prune-Share for Lifelong Learning

Figure 3 for Learn-Prune-Share for Lifelong Learning

Figure 4 for Learn-Prune-Share for Lifelong Learning

Abstract:In lifelong learning, we wish to maintain and update a model (e.g., a neural network classifier) in the presence of new classification tasks that arrive sequentially. In this paper, we propose a learn-prune-share (LPS) algorithm which addresses the challenges of catastrophic forgetting, parsimony, and knowledge reuse simultaneously. LPS splits the network into task-specific partitions via an ADMM-based pruning strategy. This leads to no forgetting, while maintaining parsimony. Moreover, LPS integrates a novel selective knowledge sharing scheme into this ADMM optimization framework. This enables adaptive knowledge sharing in an end-to-end fashion. Comprehensive experimental results on two lifelong learning benchmark datasets and a challenging real-world radio frequency fingerprinting dataset are provided to demonstrate the effectiveness of our approach. Our experiments show that LPS consistently outperforms multiple state-of-the-art competitors.

* Accepted to the IEEE International Conference on Data Mining 2020 (ICDM'20)

Via

Access Paper or Ask Questions