Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hanxiang Hao

UPSCALE: Unconstrained Channel Pruning

Jul 17, 2023

Alvin Wan, Hanxiang Hao, Kaushik Patnaik, Yueyang Xu, Omer Hadad, David Güera, Zhile Ren, Qi Shan

Abstract:As neural networks grow in size and complexity, inference speeds decline. To combat this, one of the most effective compression techniques -- channel pruning -- removes channels from weights. However, for multi-branch segments of a model, channel removal can introduce inference-time memory copies. In turn, these copies increase inference latency -- so much so that the pruned model can be slower than the unpruned model. As a workaround, pruners conventionally constrain certain channels to be pruned together. This fully eliminates memory copies but, as we show, significantly impairs accuracy. We now have a dilemma: Remove constraints but increase latency, or add constraints and impair accuracy. In response, our insight is to reorder channels at export time, (1) reducing latency by reducing memory copies and (2) improving accuracy by removing constraints. Using this insight, we design a generic algorithm UPSCALE to prune models with any pruning pattern. By removing constraints from existing pruners, we improve ImageNet accuracy for post-training pruned models by 2.1 points on average -- benefiting DenseNet (+16.9), EfficientNetV2 (+7.9), and ResNet (+6.2). Furthermore, by reordering channels, UPSCALE improves inference speeds by up to 2x over a baseline export.

* 29 pages, 26 figures, accepted to ICML 2023

Via

Access Paper or Ask Questions

Improving Building Segmentation for Off-Nadir Satellite Imagery

Sep 08, 2021

Hanxiang Hao, Sriram Baireddy, Kevin LaTourette, Latisha Konz, Moses Chan, Mary L. Comer, Edward J. Delp

Figure 1 for Improving Building Segmentation for Off-Nadir Satellite Imagery

Figure 2 for Improving Building Segmentation for Off-Nadir Satellite Imagery

Figure 3 for Improving Building Segmentation for Off-Nadir Satellite Imagery

Figure 4 for Improving Building Segmentation for Off-Nadir Satellite Imagery

Abstract:Automatic building segmentation is an important task for satellite imagery analysis and scene understanding. Most existing segmentation methods focus on the case where the images are taken from directly overhead (i.e., low off-nadir/viewing angle). These methods often fail to provide accurate results on satellite images with larger off-nadir angles due to the higher noise level and lower spatial resolution. In this paper, we propose a method that is able to provide accurate building segmentation for satellite imagery captured from a large range of off-nadir angles. Based on Bayesian deep learning, we explicitly design our method to learn the data noise via aleatoric and epistemic uncertainty modeling. Satellite image metadata (e.g., off-nadir angle and ground sample distance) is also used in our model to further improve the result. We show that with uncertainty modeling and metadata injection, our method achieves better performance than the baseline method, especially for noisy images taken from large off-nadir angles.

* This is an extended version of our ACM SIGSPATIAL'21 conference paper

Via

Access Paper or Ask Questions

Manipulation Detection in Satellite Images Using Vision Transformer

May 13, 2021

János Horváth, Sriram Baireddy, Hanxiang Hao, Daniel Mas Montserrat, Edward J. Delp

Figure 1 for Manipulation Detection in Satellite Images Using Vision Transformer

Figure 2 for Manipulation Detection in Satellite Images Using Vision Transformer

Figure 3 for Manipulation Detection in Satellite Images Using Vision Transformer

Figure 4 for Manipulation Detection in Satellite Images Using Vision Transformer

Abstract:A growing number of commercial satellite companies provide easily accessible satellite imagery. Overhead imagery is used by numerous industries including agriculture, forestry, natural disaster analysis, and meteorology. Satellite images, just as any other images, can be tampered with image manipulation tools. Manipulation detection methods created for images captured by "consumer cameras" tend to fail when used on satellite images due to the differences in image sensors, image acquisition, and processing. In this paper we propose an unsupervised technique that uses a Vision Transformer to detect spliced areas within satellite images. We introduce a new dataset which includes manipulated satellite images that contain spliced objects. We show that our proposed approach performs better than existing unsupervised splicing detection techniques.

Via

Access Paper or Ask Questions

FaR-GAN for One-Shot Face Reenactment

May 13, 2020

Hanxiang Hao, Sriram Baireddy, Amy R. Reibman, Edward J. Delp

Figure 1 for FaR-GAN for One-Shot Face Reenactment

Figure 2 for FaR-GAN for One-Shot Face Reenactment

Figure 3 for FaR-GAN for One-Shot Face Reenactment

Figure 4 for FaR-GAN for One-Shot Face Reenactment

Abstract:Animating a static face image with target facial expressions and movements is important in the area of image editing and movie production. This face reenactment process is challenging due to the complex geometry and movement of human faces. Previous work usually requires a large set of images from the same person to model the appearance. In this paper, we present a one-shot face reenactment model, FaR-GAN, that takes only one face image of any given source identity and a target expression as input, and then produces a face image of the same source identity but with the target expression. The proposed method makes no assumptions about the source identity, facial expression, head pose, or even image background. We evaluate our method on the VoxCeleb1 dataset and show that our method is able to generate a higher quality face image than the compared methods.

* This paper has been accepted to the AI for content creation workshop at CVPR 2020

Via

Access Paper or Ask Questions

Deepfakes Detection with Automatic Face Weighting

May 04, 2020

Daniel Mas Montserrat, Hanxiang Hao, S. K. Yarlagadda, Sriram Baireddy, Ruiting Shao, János Horváth, Emily Bartusiak, Justin Yang, David Güera, Fengqing Zhu(+1 more)

Figure 1 for Deepfakes Detection with Automatic Face Weighting

Figure 2 for Deepfakes Detection with Automatic Face Weighting

Figure 3 for Deepfakes Detection with Automatic Face Weighting

Figure 4 for Deepfakes Detection with Automatic Face Weighting

Abstract:Altered and manipulated multimedia is increasingly present and widely distributed via social media platforms. Advanced video manipulation tools enable the generation of highly realistic-looking altered multimedia. While many methods have been presented to detect manipulations, most of them fail when evaluated with data outside of the datasets used in research environments. In order to address this problem, the Deepfake Detection Challenge (DFDC) provides a large dataset of videos containing realistic manipulations and an evaluation system that ensures that methods work quickly and accurately, even when faced with challenging data. In this paper, we introduce a method based on convolutional neural networks (CNNs) and recurrent neural networks (RNNs) that extracts visual and temporal features from faces present in videos to accurately detect manipulations. The method is evaluated with the DFDC dataset, providing competitive results compared to other techniques.

Via

Access Paper or Ask Questions

An Attention-Based System for Damage Assessment Using Satellite Imagery

Apr 14, 2020

Hanxiang Hao, Sriram Baireddy, Emily R. Bartusiak, Latisha Konz, Kevin LaTourette, Michael Gribbons, Moses Chan, Mary L. Comer, Edward J. Delp

Figure 1 for An Attention-Based System for Damage Assessment Using Satellite Imagery

Figure 2 for An Attention-Based System for Damage Assessment Using Satellite Imagery

Figure 3 for An Attention-Based System for Damage Assessment Using Satellite Imagery

Figure 4 for An Attention-Based System for Damage Assessment Using Satellite Imagery

Abstract:When disaster strikes, accurate situational information and a fast, effective response are critical to save lives. Widely available, high resolution satellite images enable emergency responders to estimate locations, causes, and severity of damage. Quickly and accurately analyzing the extensive amount of satellite imagery available, though, requires an automatic approach. In this paper, we present Siam-U-Net-Attn model - a multi-class deep learning model with an attention mechanism - to assess damage levels of buildings given a pair of satellite images depicting a scene before and after a disaster. We evaluate the proposed method on xView2, a large-scale building damage assessment dataset, and demonstrate that the proposed approach achieves accurate damage scale classification and building segmentation results simultaneously.

* 10 pages, 9 figures

Via

Access Paper or Ask Questions

A Utility-Preserving GAN for Face Obscuration

Jun 27, 2019

Hanxiang Hao, David Güera, Amy R. Reibman, Edward J. Delp

Figure 1 for A Utility-Preserving GAN for Face Obscuration

Figure 2 for A Utility-Preserving GAN for Face Obscuration

Figure 3 for A Utility-Preserving GAN for Face Obscuration

Figure 4 for A Utility-Preserving GAN for Face Obscuration

Abstract:From TV news to Google StreetView, face obscuration has been used for privacy protection. Due to recent advances in the field of deep learning, obscuration methods such as Gaussian blurring and pixelation are not guaranteed to conceal identity. In this paper, we propose a utility-preserving generative model, UP-GAN, that is able to provide an effective face obscuration, while preserving facial utility. By utility-preserving we mean preserving facial features that do not reveal identity, such as age, gender, skin tone, pose, and expression. We show that the proposed method achieves the best performance in terms of obscuration and utility preservation.

* 6 pages, 5 figures, presented at the ICML 2019 Worksop on Synthetic Realities: Deep Learning for Detecting AudioVisual Fakes

Via

Access Paper or Ask Questions

Robustness Analysis of Face Obscuration

May 13, 2019

Hanxiang Hao, David Güera, Amy R. Reibman, Edward J. Delp

Figure 1 for Robustness Analysis of Face Obscuration

Figure 2 for Robustness Analysis of Face Obscuration

Figure 3 for Robustness Analysis of Face Obscuration

Figure 4 for Robustness Analysis of Face Obscuration

Abstract:Face obscuration is often needed by law enforcement or mass media outlets to provide privacy protection. Sharing sensitive content where the obscuration or redaction technique may have failed to completely remove all identifiable traces can lead to life-threatening consequences. Hence, it is critical to be able to systematically measure the face obscuration performance of a given technique. In this paper we propose to measure the effectiveness of three obscuration techniques: Gaussian blurring, median blurring, and pixelation. We do so by identifying the redacted faces under two scenarios: classifying an obscured face into a group of identities and comparing the similarity of an obscured face with a clear face. Threat modeling is also considered to provide a vulnerability analysis for each studied obscuration technique. Based on our evaluation, we show that pixelation-based face obscuration approaches are the most effective.

Via

Access Paper or Ask Questions