Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Qigong Sun

Toward Motion Robustness: A masked attention regularization framework in remote photoplethysmography

Jul 09, 2024

Pengfei Zhao, Qigong Sun, Xiaolin Tian, Yige Yang, Shuo Tao, Jie Cheng, Jiantong Chen

Figure 1 for Toward Motion Robustness: A masked attention regularization framework in remote photoplethysmography

Figure 2 for Toward Motion Robustness: A masked attention regularization framework in remote photoplethysmography

Figure 3 for Toward Motion Robustness: A masked attention regularization framework in remote photoplethysmography

Figure 4 for Toward Motion Robustness: A masked attention regularization framework in remote photoplethysmography

Abstract:There has been growing interest in facial video-based remote photoplethysmography (rPPG) measurement recently, with a focus on assessing various vital signs such as heart rate and heart rate variability. Despite previous efforts on static datasets, their approaches have been hindered by inaccurate region of interest (ROI) localization and motion issues, and have shown limited generalization in real-world scenarios. To address these challenges, we propose a novel masked attention regularization (MAR-rPPG) framework that mitigates the impact of ROI localization and complex motion artifacts. Specifically, our approach first integrates a masked attention regularization mechanism into the rPPG field to capture the visual semantic consistency of facial clips, while it also employs a masking technique to prevent the model from overfitting on inaccurate ROIs and subsequently degrading its performance. Furthermore, we propose an enhanced rPPG expert aggregation (EREA) network as the backbone to obtain rPPG signals and attention maps simultaneously. Our EREA network is capable of discriminating divergent attentions from different facial areas and retaining the consistency of spatiotemporal attention maps. For motion robustness, a simple open source detector MediaPipe for data preprocessing is sufficient for our framework due to its superior capability of rPPG signal extraction and attention regularization. Exhaustive experiments on three benchmark datasets (UBFC-rPPG, PURE, and MMPD) substantiate the superiority of our proposed method, outperforming recent state-of-the-art works by a considerable margin.

* CVPR workshop 2024 accepted

Via

Access Paper or Ask Questions

Quantized Neural Networks via {-1, +1} Encoding Decomposition and Acceleration

Jun 18, 2021

Qigong Sun, Xiufang Li, Fanhua Shang, Hongying Liu, Kang Yang, Licheng Jiao, Zhouchen Lin

Figure 1 for Quantized Neural Networks via {-1, +1} Encoding Decomposition and Acceleration

Figure 2 for Quantized Neural Networks via {-1, +1} Encoding Decomposition and Acceleration

Figure 3 for Quantized Neural Networks via {-1, +1} Encoding Decomposition and Acceleration

Figure 4 for Quantized Neural Networks via {-1, +1} Encoding Decomposition and Acceleration

Abstract:The training of deep neural networks (DNNs) always requires intensive resources for both computation and data storage. Thus, DNNs cannot be efficiently applied to mobile phones and embedded devices, which severely limits their applicability in industrial applications. To address this issue, we propose a novel encoding scheme using {-1, +1} to decompose quantized neural networks (QNNs) into multi-branch binary networks, which can be efficiently implemented by bitwise operations (i.e., xnor and bitcount) to achieve model compression, computational acceleration, and resource saving. By using our method, users can achieve different encoding precisions arbitrarily according to their requirements and hardware resources. The proposed mechanism is highly suitable for the use of FPGA and ASIC in terms of data storage and computation, which provides a feasible idea for smart chips. We validate the effectiveness of our method on large-scale image classification (e.g., ImageNet), object detection, and semantic segmentation tasks. In particular, our method with low-bit encoding can still achieve almost the same performance as its high-bit counterparts.

* arXiv admin note: substantial text overlap with arXiv:1905.13389

Via

Access Paper or Ask Questions

One Model for All Quantization: A Quantized Network Supporting Hot-Swap Bit-Width Adjustment

May 04, 2021

Qigong Sun, Xiufang Li, Yan Ren, Zhongjian Huang, Xu Liu, Licheng Jiao, Fang Liu

Figure 1 for One Model for All Quantization: A Quantized Network Supporting Hot-Swap Bit-Width Adjustment

Figure 2 for One Model for All Quantization: A Quantized Network Supporting Hot-Swap Bit-Width Adjustment

Figure 3 for One Model for All Quantization: A Quantized Network Supporting Hot-Swap Bit-Width Adjustment

Figure 4 for One Model for All Quantization: A Quantized Network Supporting Hot-Swap Bit-Width Adjustment

Abstract:As an effective technique to achieve the implementation of deep neural networks in edge devices, model quantization has been successfully applied in many practical applications. No matter the methods of quantization aware training (QAT) or post-training quantization (PTQ), they all depend on the target bit-widths. When the precision of quantization is adjusted, it is necessary to fine-tune the quantized model or minimize the quantization noise, which brings inconvenience in practical applications. In this work, we propose a method to train a model for all quantization that supports diverse bit-widths (e.g., form 8-bit to 1-bit) to satisfy the online quantization bit-width adjustment. It is hot-swappable that can provide specific quantization strategies for different candidates through multiscale quantization. We use wavelet decomposition and reconstruction to increase the diversity of weights, thus significantly improving the performance of each quantization candidate, especially at ultra-low bit-widths (e.g., 3-bit, 2-bit, and 1-bit). Experimental results on ImageNet and COCO show that our method can achieve accuracy comparable performance to dedicated models trained at the same precision.

Via

Access Paper or Ask Questions

MWQ: Multiscale Wavelet Quantized Neural Networks

Mar 09, 2021

Qigong Sun, Yan Ren, Licheng Jiao, Xiufang Li, Fanhua Shang, Fang Liu

Figure 1 for MWQ: Multiscale Wavelet Quantized Neural Networks

Figure 2 for MWQ: Multiscale Wavelet Quantized Neural Networks

Figure 3 for MWQ: Multiscale Wavelet Quantized Neural Networks

Figure 4 for MWQ: Multiscale Wavelet Quantized Neural Networks

Abstract:Model quantization can reduce the model size and computational latency, it has become an essential technique for the deployment of deep neural networks on resourceconstrained hardware (e.g., mobile phones and embedded devices). The existing quantization methods mainly consider the numerical elements of the weights and activation values, ignoring the relationship between elements. The decline of representation ability and information loss usually lead to the performance degradation. Inspired by the characteristics of images in the frequency domain, we propose a novel multiscale wavelet quantization (MWQ) method. This method decomposes original data into multiscale frequency components by wavelet transform, and then quantizes the components of different scales, respectively. It exploits the multiscale frequency and spatial information to alleviate the information loss caused by quantization in the spatial domain. Because of the flexibility of MWQ, we demonstrate three applications (e.g., model compression, quantized network optimization, and information enhancement) on the ImageNet and COCO datasets. Experimental results show that our method has stronger representation ability and can play an effective role in quantized neural networks.

Via

Access Paper or Ask Questions

Effective and Fast: A Novel Sequential Single Path Search for Mixed-Precision Quantization

Mar 04, 2021

Qigong Sun, Licheng Jiao, Yan Ren, Xiufang Li, Fanhua Shang, Fang Liu

Figure 1 for Effective and Fast: A Novel Sequential Single Path Search for Mixed-Precision Quantization

Figure 2 for Effective and Fast: A Novel Sequential Single Path Search for Mixed-Precision Quantization

Figure 3 for Effective and Fast: A Novel Sequential Single Path Search for Mixed-Precision Quantization

Figure 4 for Effective and Fast: A Novel Sequential Single Path Search for Mixed-Precision Quantization

Abstract:Since model quantization helps to reduce the model size and computation latency, it has been successfully applied in many applications of mobile phones, embedded devices and smart chips. The mixed-precision quantization model can match different quantization bit-precisions according to the sensitivity of different layers to achieve great performance. However, it is a difficult problem to quickly determine the quantization bit-precision of each layer in deep neural networks according to some constraints (e.g., hardware resources, energy consumption, model size and computation latency). To address this issue, we propose a novel sequential single path search (SSPS) method for mixed-precision quantization,in which the given constraints are introduced into its loss function to guide searching process. A single path search cell is used to combine a fully differentiable supernet, which can be optimized by gradient-based algorithms. Moreover, we sequentially determine the candidate precisions according to the selection certainties to exponentially reduce the search space and speed up the convergence of searching process. Experiments show that our method can efficiently search the mixed-precision models for different architectures (e.g., ResNet-20, 18, 34, 50 and MobileNet-V2) and datasets (e.g., CIFAR-10, ImageNet and COCO) under given constraints, and our experimental results verify that SSPS significantly outperforms their uniform counterparts.

Via

Access Paper or Ask Questions

signADAM: Learning Confidences for Deep Neural Networks

Jul 21, 2019

Dong Wang, Yicheng Liu, Wenwo Tang, Fanhua Shang, Hongying Liu, Qigong Sun, Licheng Jiao

Figure 1 for signADAM: Learning Confidences for Deep Neural Networks

Figure 2 for signADAM: Learning Confidences for Deep Neural Networks

Figure 3 for signADAM: Learning Confidences for Deep Neural Networks

Figure 4 for signADAM: Learning Confidences for Deep Neural Networks

Abstract:In this paper, we propose a new first-order gradient-based algorithm to train deep neural networks. We first introduce the sign operation of stochastic gradients (as in sign-based methods, e.g., SIGN-SGD) into ADAM, which is called as signADAM. Moreover, in order to make the rate of fitting each feature closer, we define a confidence function to distinguish different components of gradients and apply it to our algorithm. It can generate more sparse gradients than existing algorithms do. We call this new algorithm signADAM++. In particular, both our algorithms are easy to implement and can speed up training of various deep neural networks. The motivation of signADAM++ is preferably learning features from the most different samples by updating large and useful gradients regardless of useless information in stochastic gradients. We also establish theoretical convergence guarantees for our algorithms. Empirical results on various datasets and models show that our algorithms yield much better performance than many state-of-the-art algorithms including SIGN-SGD, SIGNUM and ADAM. We also analyze the performance from multiple perspectives including the loss landscape and develop an adaptive method to further improve generalization. The source code is available at https://github.com/DongWanginxdu/signADAM-Learn-by-Confidence.

* 11 pages, 7 figures

Via

Access Paper or Ask Questions

Pixel DAG-Recurrent Neural Network for Spectral-Spatial Hyperspectral Image Classification

Jun 09, 2019

Xiufang Li, Qigong Sun, Lingling Li, Zhongle Ren, Fang Liu, Licheng Jiao

Figure 1 for Pixel DAG-Recurrent Neural Network for Spectral-Spatial Hyperspectral Image Classification

Figure 2 for Pixel DAG-Recurrent Neural Network for Spectral-Spatial Hyperspectral Image Classification

Figure 3 for Pixel DAG-Recurrent Neural Network for Spectral-Spatial Hyperspectral Image Classification

Figure 4 for Pixel DAG-Recurrent Neural Network for Spectral-Spatial Hyperspectral Image Classification

Abstract:Exploiting rich spatial and spectral features contributes to improve the classification accuracy of hyperspectral images (HSIs). In this paper, based on the mechanism of the population receptive field (pRF) in human visual cortex, we further utilize the spatial correlation of pixels in images and propose pixel directed acyclic graph recurrent neural network (Pixel DAG-RNN) to extract and apply spectral-spatial features for HSIs classification. In our model, an undirected cyclic graph (UCG) is used to represent the relevance connectivity of pixels in an image patch, and four DAGs are used to approximate the spatial relationship of UCGs. In order to avoid overfitting, weight sharing and dropout are adopted. The higher classification performance of our model on HSIs classification has been verified by experiments on three benchmark data sets.

Via

Access Paper or Ask Questions

Semi-supervised Complex-valued GAN for Polarimetric SAR Image Classification

Jun 09, 2019

Qigong Sun, Xiufang Li, Lingling Li, Xu Liu, Fang Liu, Licheng Jiao

Figure 1 for Semi-supervised Complex-valued GAN for Polarimetric SAR Image Classification

Figure 2 for Semi-supervised Complex-valued GAN for Polarimetric SAR Image Classification

Figure 3 for Semi-supervised Complex-valued GAN for Polarimetric SAR Image Classification

Figure 4 for Semi-supervised Complex-valued GAN for Polarimetric SAR Image Classification

Abstract:Polarimetric synthetic aperture radar (PolSAR) images are widely used in disaster detection and military reconnaissance and so on. However, their interpretation faces some challenges, e.g., deficiency of labeled data, inadequate utilization of data information and so on. In this paper, a complex-valued generative adversarial network (GAN) is proposed for the first time to address these issues. The complex number form of model complies with the physical mechanism of PolSAR data and in favor of utilizing and retaining amplitude and phase information of PolSAR data. GAN architecture and semi-supervised learning are combined to handle deficiency of labeled data. GAN expands training data and semi-supervised learning is used to train network with generated, labeled and unlabeled data. Experimental results on two benchmark data sets show that our model outperforms existing state-of-the-art models, especially for conditions with fewer labeled data.

Via

Access Paper or Ask Questions

Multi-Precision Quantized Neural Networks via Encoding Decomposition of -1 and +1

May 31, 2019

Qigong Sun, Fanhua Shang, Kang Yang, Xiufang Li, Yan Ren, Licheng Jiao

Figure 1 for Multi-Precision Quantized Neural Networks via Encoding Decomposition of -1 and +1

Figure 2 for Multi-Precision Quantized Neural Networks via Encoding Decomposition of -1 and +1

Figure 3 for Multi-Precision Quantized Neural Networks via Encoding Decomposition of -1 and +1

Figure 4 for Multi-Precision Quantized Neural Networks via Encoding Decomposition of -1 and +1

Abstract:The training of deep neural networks (DNNs) requires intensive resources both for computation and for storage performance. Thus, DNNs cannot be efficiently applied to mobile phones and embedded devices, which seriously limits their applicability in industry applications. To address this issue, we propose a novel encoding scheme of using {-1,+1} to decompose quantized neural networks (QNNs) into multi-branch binary networks, which can be efficiently implemented by bitwise operations (xnor and bitcount) to achieve model compression, computational acceleration and resource saving. Based on our method, users can easily achieve different encoding precisions arbitrarily according to their requirements and hardware resources. The proposed mechanism is very suitable for the use of FPGA and ASIC in terms of data storage and computation, which provides a feasible idea for smart chips. We validate the effectiveness of our method on both large-scale image classification tasks (e.g., ImageNet) and object detection tasks. In particular, our method with low-bit encoding can still achieve almost the same performance as its full-precision counterparts.

* 9 pages, 2 figures, Proc. 33rd AAAI Conf. Artif. Intell., 2019

Via

Access Paper or Ask Questions

Polarimetric Convolutional Network for PolSAR Image Classification

Jul 09, 2018

Xu Liu, Licheng Jiao, Xu Tang, Qigong Sun, Dan Zhang

Figure 1 for Polarimetric Convolutional Network for PolSAR Image Classification

Figure 2 for Polarimetric Convolutional Network for PolSAR Image Classification

Figure 3 for Polarimetric Convolutional Network for PolSAR Image Classification

Figure 4 for Polarimetric Convolutional Network for PolSAR Image Classification

Abstract:The approaches for analyzing the polarimetric scattering matrix of polarimetric synthetic aperture radar (PolSAR) data have always been the focus of PolSAR image classification. Generally, the polarization coherent matrix and the covariance matrix obtained by the polarimetric scattering matrix only show a limited number of polarimetric information. In order to solve this problem, we propose a sparse scattering coding way to deal with polarimetric scattering matrix and obtain a close complete feature. This encoding mode can also maintain polarimetric information of scattering matrix completely. At the same time, in view of this encoding way, we design a corresponding classification algorithm based on convolution network to combine this feature. Based on sparse scattering coding and convolution neural network, the polarimetric convolutional network is proposed to classify PolSAR images by making full use of polarimetric information. We perform the experiments on the PolSAR images acquired by AIRSAR and RADARSAT-2 to verify the proposed method. The experimental results demonstrate that the proposed method get better results and has huge potential for PolSAR data classification. Source code for sparse scattering coding is available at https://github.com/liuxuvip/Polarimetric-Scattering-Coding.

* 13 pages

Via

Access Paper or Ask Questions