Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Heng Qi

Structure-Aware Automatic Channel Pruning by Searching with Graph Embedding

Jun 13, 2025

Zifan Liu, Yuan Cao, Yanwei Yu, Heng Qi, Jie Gui

Abstract:Channel pruning is a powerful technique to reduce the computational overhead of deep neural networks, enabling efficient deployment on resource-constrained devices. However, existing pruning methods often rely on local heuristics or weight-based criteria that fail to capture global structural dependencies within the network, leading to suboptimal pruning decisions and degraded model performance. To address these limitations, we propose a novel structure-aware automatic channel pruning (SACP) framework that utilizes graph convolutional networks (GCNs) to model the network topology and learn the global importance of each channel. By encoding structural relationships within the network, our approach implements topology-aware pruning and this pruning is fully automated, reducing the need for human intervention. We restrict the pruning rate combinations to a specific space, where the number of combinations can be dynamically adjusted, and use a search-based approach to determine the optimal pruning rate combinations. Extensive experiments on benchmark datasets (CIFAR-10, ImageNet) with various models (ResNet, VGG16) demonstrate that SACP outperforms state-of-the-art pruning methods on compression efficiency and competitive on accuracy retention.

* 12 pages, 2 figures

Via

Access Paper or Ask Questions

KGTrust: Evaluating Trustworthiness of SIoT via Knowledge Enhanced Graph Neural Networks

Feb 22, 2023

Zhizhi Yu, Di Jin, Cuiying Huo, Zhiqiang Wang, Xiulong Liu, Heng Qi, Jia Wu, Lingfei Wu

Figure 1 for KGTrust: Evaluating Trustworthiness of SIoT via Knowledge Enhanced Graph Neural Networks

Figure 2 for KGTrust: Evaluating Trustworthiness of SIoT via Knowledge Enhanced Graph Neural Networks

Figure 3 for KGTrust: Evaluating Trustworthiness of SIoT via Knowledge Enhanced Graph Neural Networks

Figure 4 for KGTrust: Evaluating Trustworthiness of SIoT via Knowledge Enhanced Graph Neural Networks

Abstract:Social Internet of Things (SIoT), a promising and emerging paradigm that injects the notion of social networking into smart objects (i.e., things), paving the way for the next generation of Internet of Things. However, due to the risks and uncertainty, a crucial and urgent problem to be settled is establishing reliable relationships within SIoT, that is, trust evaluation. Graph neural networks for trust evaluation typically adopt a straightforward way such as one-hot or node2vec to comprehend node characteristics, which ignores the valuable semantic knowledge attached to nodes. Moreover, the underlying structure of SIoT is usually complex, including both the heterogeneous graph structure and pairwise trust relationships, which renders hard to preserve the properties of SIoT trust during information propagation. To address these aforementioned problems, we propose a novel knowledge-enhanced graph neural network (KGTrust) for better trust evaluation in SIoT. Specifically, we first extract useful knowledge from users' comment behaviors and external structured triples related to object descriptions, in order to gain a deeper insight into the semantics of users and objects. Furthermore, we introduce a discriminative convolutional layer that utilizes heterogeneous graph structure, node semantics, and augmented trust relationships to learn node embeddings from the perspective of a user as a trustor or a trustee, effectively capturing multi-aspect properties of SIoT trust during information propagation. Finally, a trust prediction layer is developed to estimate the trust relationships between pairwise nodes. Extensive experiments on three public datasets illustrate the superior performance of KGTrust over state-of-the-art methods.

* Accepted by WWW-23

Via

Access Paper or Ask Questions

One-shot Network Pruning at Initialization with Discriminative Image Patches

Sep 13, 2022

Yinan Yang, Ying Ji, Yu Wang, Heng Qi, Jien Kato

Figure 1 for One-shot Network Pruning at Initialization with Discriminative Image Patches

Figure 2 for One-shot Network Pruning at Initialization with Discriminative Image Patches

Figure 3 for One-shot Network Pruning at Initialization with Discriminative Image Patches

Figure 4 for One-shot Network Pruning at Initialization with Discriminative Image Patches

Abstract:One-shot Network Pruning at Initialization (OPaI) is an effective method to decrease network pruning costs. Recently, there is a growing belief that data is unnecessary in OPaI. However, we obtain an opposite conclusion by ablation experiments in two representative OPaI methods, SNIP and GraSP. Specifically, we find that informative data is crucial to enhancing pruning performance. In this paper, we propose two novel methods, Discriminative One-shot Network Pruning (DOP) and Super Stitching, to prune the network by high-level visual discriminative image patches. Our contributions are as follows. (1) Extensive experiments reveal that OPaI is data-dependent. (2) Super Stitching performs significantly better than the original OPaI method on benchmark ImageNet, especially in a highly compressed model.

Via

Access Paper or Ask Questions

NodeTrans: A Graph Transfer Learning Approach for Traffic Prediction

Jul 04, 2022

Xueyan Yin, Feifan Li, Yanming Shen, Heng Qi, Baocai Yin

Figure 1 for NodeTrans: A Graph Transfer Learning Approach for Traffic Prediction

Figure 2 for NodeTrans: A Graph Transfer Learning Approach for Traffic Prediction

Figure 3 for NodeTrans: A Graph Transfer Learning Approach for Traffic Prediction

Figure 4 for NodeTrans: A Graph Transfer Learning Approach for Traffic Prediction

Abstract:Recently, deep learning methods have made great progress in traffic prediction, but their performance depends on a large amount of historical data. In reality, we may face the data scarcity issue. In this case, deep learning models fail to obtain satisfactory performance. Transfer learning is a promising approach to solve the data scarcity issue. However, existing transfer learning approaches in traffic prediction are mainly based on regular grid data, which is not suitable for the inherent graph data in the traffic network. Moreover, existing graph-based models can only capture shared traffic patterns in the road network, and how to learn node-specific patterns is also a challenge. In this paper, we propose a novel transfer learning approach to solve the traffic prediction with few data, which can transfer the knowledge learned from a data-rich source domain to a data-scarce target domain. First, a spatial-temporal graph neural network is proposed, which can capture the node-specific spatial-temporal traffic patterns of different road networks. Then, to improve the robustness of transfer, we design a pattern-based transfer strategy, where we leverage a clustering-based mechanism to distill common spatial-temporal patterns in the source domain, and use these knowledge to further improve the prediction performance of the target domain. Experiments on real-world datasets verify the effectiveness of our approach.

Via

Access Paper or Ask Questions

Soft-mask: Adaptive Substructure Extractions for Graph Neural Networks

Jun 11, 2022

Mingqi Yang, Yanming Shen, Heng Qi, Baocai Yin

Figure 1 for Soft-mask: Adaptive Substructure Extractions for Graph Neural Networks

Figure 2 for Soft-mask: Adaptive Substructure Extractions for Graph Neural Networks

Figure 3 for Soft-mask: Adaptive Substructure Extractions for Graph Neural Networks

Figure 4 for Soft-mask: Adaptive Substructure Extractions for Graph Neural Networks

Abstract:For learning graph representations, not all detailed structures within a graph are relevant to the given graph tasks. Task-relevant structures can be $localized$ or $sparse$ which are only involved in subgraphs or characterized by the interactions of subgraphs (a hierarchical perspective). A graph neural network should be able to efficiently extract task-relevant structures and be invariant to irrelevant parts, which is challenging for general message passing GNNs. In this work, we propose to learn graph representations from a sequence of subgraphs of the original graph to better capture task-relevant substructures or hierarchical structures and skip $noisy$ parts. To this end, we design soft-mask GNN layer to extract desired subgraphs through the mask mechanism. The soft-mask is defined in a continuous space to maintain the differentiability and characterize the weights of different parts. Compared with existing subgraph or hierarchical representation learning methods and graph pooling operations, the soft-mask GNN layer is not limited by the fixed sample or drop ratio, and therefore is more flexible to extract subgraphs with arbitrary sizes. Extensive experiments on public graph benchmarks show that soft-mask mechanism brings performance improvements. And it also provides interpretability where visualizing the values of masks in each layer allows us to have an insight into the structures learned by the model.

* The Web Conference (WWW), 2021

Via

Access Paper or Ask Questions

Improving Spectral Graph Convolution for Learning Graph-level Representation

Dec 14, 2021

Mingqi Yang, Rui Li, Yanming Shen, Heng Qi, Baocai Yin

Figure 1 for Improving Spectral Graph Convolution for Learning Graph-level Representation

Figure 2 for Improving Spectral Graph Convolution for Learning Graph-level Representation

Figure 3 for Improving Spectral Graph Convolution for Learning Graph-level Representation

Figure 4 for Improving Spectral Graph Convolution for Learning Graph-level Representation

Abstract:From the original theoretically well-defined spectral graph convolution to the subsequent spatial bassed message-passing model, spatial locality (in vertex domain) acts as a fundamental principle of most graph neural networks (GNNs). In the spectral graph convolution, the filter is approximated by polynomials, where a $k$-order polynomial covers $k$-hop neighbors. In the message-passing, various definitions of neighbors used in aggregations are actually an extensive exploration of the spatial locality information. For learning node representations, the topological distance seems necessary since it characterizes the basic relations between nodes. However, for learning representations of the entire graphs, is it still necessary to hold? In this work, we show that such a principle is not necessary, it hinders most existing GNNs from efficiently encoding graph structures. By removing it, as well as the limitation of polynomial filters, the resulting new architecture significantly boosts performance on learning graph representations. We also study the effects of graph spectrum on signals and interpret various existing improvements as different spectrum smoothing techniques. It serves as a spatial understanding that quantitatively measures the effects of the spectrum to input signals in comparison to the well-known spectral understanding as high/low-pass filters. More importantly, it sheds the light on developing powerful graph representation models.

Via

Access Paper or Ask Questions

Federated Unlearning via Class-Discriminative Pruning

Oct 22, 2021

Junxiao Wang, Song Guo, Xin Xie, Heng Qi

Figure 1 for Federated Unlearning via Class-Discriminative Pruning

Figure 2 for Federated Unlearning via Class-Discriminative Pruning

Figure 3 for Federated Unlearning via Class-Discriminative Pruning

Figure 4 for Federated Unlearning via Class-Discriminative Pruning

Abstract:We explore the problem of selectively forgetting categories from trained CNN classification models in the federated learning (FL). Given that the data used for training cannot be accessed globally in FL, our insights probe deep into the internal influence of each channel. Through the visualization of feature maps activated by different channels, we observe that different channels have a varying contribution to different categories in image classification. Inspired by this, we propose a method for scrubbing the model clean of information about particular categories. The method does not require retraining from scratch, nor global access to the data used for training. Instead, we introduce the concept of Term Frequency Inverse Document Frequency (TF-IDF) to quantize the class discrimination of channels. Channels with high TF-IDF scores have more discrimination on the target categories and thus need to be pruned to unlearn. The channel pruning is followed by a fine-tuning process to recover the performance of the pruned model. Evaluated on CIFAR10 dataset, our method accelerates the speed of unlearning by 8.9x for the ResNet model, and 7.9x for the VGG model under no degradation in accuracy, compared to retraining from scratch. For CIFAR100 dataset, the speedups are 9.9x and 8.4x, respectively. We envision this work as a complementary block for FL towards compliance with legal and ethical criteria.

* 15 pages

Via

Access Paper or Ask Questions

Breaking the Expressive Bottlenecks of Graph Neural Networks

Dec 14, 2020

Mingqi Yang, Yanming Shen, Heng Qi, Baocai Yin

Figure 1 for Breaking the Expressive Bottlenecks of Graph Neural Networks

Figure 2 for Breaking the Expressive Bottlenecks of Graph Neural Networks

Figure 3 for Breaking the Expressive Bottlenecks of Graph Neural Networks

Figure 4 for Breaking the Expressive Bottlenecks of Graph Neural Networks

Abstract:Recently, the Weisfeiler-Lehman (WL) graph isomorphism test was used to measure the expressiveness of graph neural networks (GNNs), showing that the neighborhood aggregation GNNs were at most as powerful as 1-WL test in distinguishing graph structures. There were also improvements proposed in analogy to $k$-WL test ($k>1$). However, the aggregators in these GNNs are far from injective as required by the WL test, and suffer from weak distinguishing strength, making it become expressive bottlenecks. In this paper, we improve the expressiveness by exploring powerful aggregators. We reformulate aggregation with the corresponding aggregation coefficient matrix, and then systematically analyze the requirements of the aggregation coefficient matrix for building more powerful aggregators and even injective aggregators. It can also be viewed as the strategy for preserving the rank of hidden features, and implies that basic aggregators correspond to a special case of low-rank transformations. We also show the necessity of applying nonlinear units ahead of aggregation, which is different from most aggregation-based GNNs. Based on our theoretical analysis, we develop two GNN layers, ExpandingConv and CombConv. Experimental results show that our models significantly boost performance, especially for large and densely connected graphs.

Via

Access Paper or Ask Questions

A Comprehensive Survey on Traffic Prediction

Apr 29, 2020

Xueyan Yin, Genze Wu, Jinze Wei, Yanming Shen, Heng Qi, Baocai Yin

Figure 1 for A Comprehensive Survey on Traffic Prediction

Figure 2 for A Comprehensive Survey on Traffic Prediction

Figure 3 for A Comprehensive Survey on Traffic Prediction

Figure 4 for A Comprehensive Survey on Traffic Prediction

Abstract:Traffic prediction plays an essential role in intelligent transportation system. Accurate traffic prediction can assist route planing, guide vehicle dispatching, and mitigate traffic congestion. This problem is challenging due to the complicated and dynamic spatio-temporal dependencies between different regions in the road network. Recently, a significant amount of research efforts have been devoted to this area, greatly advancing traffic prediction abilities. The purpose of this paper is to provide a comprehensive survey for traffic prediction. Specifically, we first summarize the existing traffic prediction methods, and give a taxonomy of them. Second, we list the common applications of traffic prediction and the state-of-the-art in these applications. Third, we collect and organize widely used public datasets in the existing literature. Furthermore, we give an evaluation by conducting extensive experiments to compare the performance of methods related to traffic demand and speed prediction respectively on two datasets. Finally, we discuss potential future directions.

Via

Access Paper or Ask Questions

An efficient deep learning hashing neural network for mobile visual search

Oct 21, 2017

Heng Qi, Wu Liu, Liang Liu

Figure 1 for An efficient deep learning hashing neural network for mobile visual search

Figure 2 for An efficient deep learning hashing neural network for mobile visual search

Figure 3 for An efficient deep learning hashing neural network for mobile visual search

Figure 4 for An efficient deep learning hashing neural network for mobile visual search

Abstract:Mobile visual search applications are emerging that enable users to sense their surroundings with smart phones. However, because of the particular challenges of mobile visual search, achieving a high recognition bitrate has becomes a consistent target of previous related works. In this paper, we propose a few-parameter, low-latency, and high-accuracy deep hashing approach for constructing binary hash codes for mobile visual search. First, we exploit the architecture of the MobileNet model, which significantly decreases the latency of deep feature extraction by reducing the number of model parameters while maintaining accuracy. Second, we add a hash-like layer into MobileNet to train the model on labeled mobile visual data. Evaluations show that the proposed system can exceed state-of-the-art accuracy performance in terms of the MAP. More importantly, the memory consumption is much less than that of other deep learning models. The proposed method requires only $13$ MB of memory for the neural network and achieves a MAP of $97.80\%$ on the mobile location recognition dataset used for testing.

Via

Access Paper or Ask Questions