Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhuwei Qin

Fed2: Feature-Aligned Federated Learning

Nov 28, 2021

Fuxun Yu, Weishan Zhang, Zhuwei Qin, Zirui Xu, Di Wang, Chenchen Liu, Zhi Tian, Xiang Chen

Figure 1 for Fed2: Feature-Aligned Federated Learning

Figure 2 for Fed2: Feature-Aligned Federated Learning

Figure 3 for Fed2: Feature-Aligned Federated Learning

Figure 4 for Fed2: Feature-Aligned Federated Learning

Abstract:Federated learning learns from scattered data by fusing collaborative models from local nodes. However, the conventional coordinate-based model averaging by FedAvg ignored the random information encoded per parameter and may suffer from structural feature misalignment. In this work, we propose Fed2, a feature-aligned federated learning framework to resolve this issue by establishing a firm structure-feature alignment across the collaborative models. Fed2 is composed of two major designs: First, we design a feature-oriented model structure adaptation method to ensure explicit feature allocation in different neural network structures. Applying the structure adaptation to collaborative models, matchable structures with similar feature information can be initialized at the very early training stage. During the federated learning process, we then propose a feature paired averaging scheme to guarantee aligned feature distribution and maintain no feature fusion conflicts under either IID or non-IID scenarios. Eventually, Fed2 could effectively enhance the federated learning convergence performance under extensive homo- and heterogeneous settings, providing excellent convergence speed, accuracy, and computation/communication efficiency.

* Accepted in KDD 2021

Via

Access Paper or Ask Questions

Heterogeneous Federated Learning

Aug 15, 2020

Fuxun Yu, Weishan Zhang, Zhuwei Qin, Zirui Xu, Di Wang, Chenchen Liu, Zhi Tian, Xiang Chen

Figure 1 for Heterogeneous Federated Learning

Figure 2 for Heterogeneous Federated Learning

Figure 3 for Heterogeneous Federated Learning

Figure 4 for Heterogeneous Federated Learning

Abstract:Federated learning learns from scattered data by fusing collaborative models from local nodes. However, due to chaotic information distribution, the model fusion may suffer from structural misalignment with regard to unmatched parameters. In this work, we propose a novel federated learning framework to resolve this issue by establishing a firm structure-information alignment across collaborative models. Specifically, we design a feature-oriented regulation method ({$\Psi$-Net}) to ensure explicit feature information allocation in different neural network structures. Applying this regulating method to collaborative models, matchable structures with similar feature information can be initialized at the very early training stage. During the federated learning process under either IID or non-IID scenarios, dedicated collaboration schemes further guarantee ordered information distribution with definite structure matching, so as the comprehensive model alignment. Eventually, this framework effectively enhances the federated learning applicability to extensive heterogeneous settings, while providing excellent convergence speed, accuracy, and computation/communication efficiency.

Via

Access Paper or Ask Questions

Interpreting and Evaluating Neural Network Robustness

May 10, 2019

Fuxun Yu, Zhuwei Qin, Chenchen Liu, Liang Zhao, Yanzhi Wang, Xiang Chen

Figure 1 for Interpreting and Evaluating Neural Network Robustness

Figure 2 for Interpreting and Evaluating Neural Network Robustness

Figure 3 for Interpreting and Evaluating Neural Network Robustness

Figure 4 for Interpreting and Evaluating Neural Network Robustness

Abstract:Recently, adversarial deception becomes one of the most considerable threats to deep neural networks. However, compared to extensive research in new designs of various adversarial attacks and defenses, the neural networks' intrinsic robustness property is still lack of thorough investigation. This work aims to qualitatively interpret the adversarial attack and defense mechanism through loss visualization, and establish a quantitative metric to evaluate the neural network model's intrinsic robustness. The proposed robustness metric identifies the upper bound of a model's prediction divergence in the given domain and thus indicates whether the model can maintain a stable prediction. With extensive experiments, our metric demonstrates several advantages over conventional adversarial testing accuracy based robustness estimation: (1) it provides a uniformed evaluation to models with different structures and parameter scales; (2) it over-performs conventional accuracy based robustness estimation and provides a more reliable evaluation that is invariant to different test settings; (3) it can be fast generated without considerable testing cost.

* Accepted in IJCAI'19

Via

Access Paper or Ask Questions

Interpretable Convolutional Filter Pruning

Oct 12, 2018

Zhuwei Qin, Fuxun Yu, Chenchen Liu, Liang Zhao, Xiang Chen

Figure 1 for Interpretable Convolutional Filter Pruning

Figure 2 for Interpretable Convolutional Filter Pruning

Figure 3 for Interpretable Convolutional Filter Pruning

Figure 4 for Interpretable Convolutional Filter Pruning

Abstract:The sophisticated structure of Convolutional Neural Network (CNN) allows for outstanding performance, but at the cost of intensive computation. As significant redundancies inevitably present in such a structure, many works have been proposed to prune the convolutional filters for computation cost reduction. Although extremely effective, most works are based only on quantitative characteristics of the convolutional filters, and highly overlook the qualitative interpretation of individual filter's specific functionality. In this work, we interpreted the functionality and redundancy of the convolutional filters from different perspectives, and proposed a functionality-oriented filter pruning method. With extensive experiment results, we proved the convolutional filters' qualitative significance regardless of magnitude, demonstrated significant neural network redundancy due to repetitive filter functions, and analyzed the filter functionality defection under inappropriate retraining process. Such an interpretable pruning approach not only offers outstanding computation cost optimization over previous filter pruning methods, but also interprets filter pruning process.

Via

Access Paper or Ask Questions

How convolutional neural network see the world - A survey of convolutional neural network visualization methods

May 31, 2018

Zhuwei Qin, Fuxun Yu, Chenchen Liu, Xiang Chen

Figure 1 for How convolutional neural network see the world - A survey of convolutional neural network visualization methods

Figure 2 for How convolutional neural network see the world - A survey of convolutional neural network visualization methods

Figure 3 for How convolutional neural network see the world - A survey of convolutional neural network visualization methods

Figure 4 for How convolutional neural network see the world - A survey of convolutional neural network visualization methods

Abstract:Nowadays, the Convolutional Neural Networks (CNNs) have achieved impressive performance on many computer vision related tasks, such as object detection, image recognition, image retrieval, etc. These achievements benefit from the CNNs outstanding capability to learn the input features with deep layers of neuron structures and iterative training process. However, these learned features are hard to identify and interpret from a human vision perspective, causing a lack of understanding of the CNNs internal working mechanism. To improve the CNN interpretability, the CNN visualization is well utilized as a qualitative analysis method, which translates the internal features into visually perceptible patterns. And many CNN visualization works have been proposed in the literature to interpret the CNN in perspectives of network structure, operation, and semantic concept. In this paper, we expect to provide a comprehensive survey of several representative CNN visualization methods, including Activation Maximization, Network Inversion, Deconvolutional Neural Networks (DeconvNet), and Network Dissection based visualization. These methods are presented in terms of motivations, algorithms, and experiment results. Based on these visualization methods, we also discuss their practical applications to demonstrate the significance of the CNN interpretability in areas of network design, optimization, security enhancement, etc.

* 32 pages, 21 figures. Mathematical Foundations of Computing

Via

Access Paper or Ask Questions