Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jun Han

DeTrigger: A Gradient-Centric Approach to Backdoor Attack Mitigation in Federated Learning

Nov 19, 2024

Kichang Lee, Yujin Shin, Jonghyuk Yun, Jun Han, JeongGil Ko

Abstract:Federated Learning (FL) enables collaborative model training across distributed devices while preserving local data privacy, making it ideal for mobile and embedded systems. However, the decentralized nature of FL also opens vulnerabilities to model poisoning attacks, particularly backdoor attacks, where adversaries implant trigger patterns to manipulate model predictions. In this paper, we propose DeTrigger, a scalable and efficient backdoor-robust federated learning framework that leverages insights from adversarial attack methodologies. By employing gradient analysis with temperature scaling, DeTrigger detects and isolates backdoor triggers, allowing for precise model weight pruning of backdoor activations without sacrificing benign model knowledge. Extensive evaluations across four widely used datasets demonstrate that DeTrigger achieves up to 251x faster detection than traditional methods and mitigates backdoor attacks by up to 98.9%, with minimal impact on global model accuracy. Our findings establish DeTrigger as a robust and scalable solution to protect federated learning environments against sophisticated backdoor threats.

* 14 pages

Via

Access Paper or Ask Questions

Guided Discrete Diffusion for Electronic Health Record Generation

Apr 18, 2024

Zixiang Chen, Jun Han, Yongqian Li, Yiwen Kou, Eran Halperin, Robert E. Tillman, Quanquan Gu

Abstract:Electronic health records (EHRs) are a pivotal data source that enables numerous applications in computational medicine, e.g., disease progression prediction, clinical trial design, and health economics and outcomes research. Despite wide usability, their sensitive nature raises privacy and confidentially concerns, which limit potential use cases. To tackle these challenges, we explore the use of generative models to synthesize artificial, yet realistic EHRs. While diffusion-based methods have recently demonstrated state-of-the-art performance in generating other data modalities and overcome the training instability and mode collapse issues that plague previous GAN-based approaches, their applications in EHR generation remain underexplored. The discrete nature of tabular medical code data in EHRs poses challenges for high-quality data generation, especially for continuous diffusion models. To this end, we introduce a novel tabular EHR generation method, EHR-D3PM, which enables both unconditional and conditional generation using the discrete diffusion model. Our experiments demonstrate that EHR-D3PM significantly outperforms existing generative baselines on comprehensive fidelity and utility metrics while maintaining less membership vulnerability risks. Furthermore, we show EHR-D3PM is effective as a data augmentation method and enhances performance on downstream tasks when combined with real data.

* 24 pages, 9 figures, 12 tables

Via

Access Paper or Ask Questions

SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion

Feb 20, 2024

Liumeng Xue, Chaoren Wang, Mingxuan Wang, Xueyao Zhang, Jun Han, Zhizheng Wu

Figure 1 for SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion

Figure 2 for SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion

Figure 3 for SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion

Figure 4 for SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion

Abstract:In this study, we present SingVisio, an interactive visual analysis system that aims to explain the diffusion model used in singing voice conversion. SingVisio provides a visual display of the generation process in diffusion models, showcasing the step-by-step denoising of the noisy spectrum and its transformation into a clean spectrum that captures the desired singer's timbre. The system also facilitates side-by-side comparisons of different conditions, such as source content, melody, and target timbre, highlighting the impact of these conditions on the diffusion generation process and resulting conversions. Through comprehensive evaluations, SingVisio demonstrates its effectiveness in terms of system design, functionality, explainability, and user-friendliness. It offers users of various backgrounds valuable learning experiences and insights into the diffusion model for singing voice conversion.

Via

Access Paper or Ask Questions

Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Dec 15, 2023

Xueyao Zhang, Liumeng Xue, Yuancheng Wang, Yicheng Gu, Xi Chen, Zihao Fang, Haopeng Chen, Lexiao Zou, Chaoren Wang, Jun Han(+3 more)

Figure 1 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Figure 2 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Figure 3 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Figure 4 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Abstract:Amphion is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development. Amphion offers a unique feature: visualizations of classic models or architectures. We believe that these visualizations are beneficial for junior researchers and engineers who wish to gain a better understanding of the model. The North-Star objective of Amphion is to offer a platform for studying the conversion of any inputs into general audio. Amphion is designed to support individual generation tasks. In addition to the specific generation tasks, Amphion also includes several vocoders and evaluation metrics. A vocoder is an important module for producing high-quality audio signals, while evaluation metrics are critical for ensuring consistent metrics in generation tasks. In this paper, we provide a high-level overview of Amphion.

* GitHub: https://github.com/open-mmlab/Amphion

Via

Access Paper or Ask Questions

Contextual Object Detection with Multimodal Large Language Models

May 29, 2023

Yuhang Zang, Wei Li, Jun Han, Kaiyang Zhou, Chen Change Loy

Abstract:Recent Multimodal Large Language Models (MLLMs) are remarkable in vision-language tasks, such as image captioning and question answering, but lack the essential perception ability, i.e., object detection. In this work, we address this limitation by introducing a novel research problem of contextual object detection -- understanding visible objects within different human-AI interactive contexts. Three representative scenarios are investigated, including the language cloze test, visual captioning, and question answering. Moreover, we present ContextDET, a unified multimodal model that is capable of end-to-end differentiable modeling of visual-language contexts, so as to locate, identify, and associate visual objects with language inputs for human-AI interaction. Our ContextDET involves three key submodels: (i) a visual encoder for extracting visual representations, (ii) a pre-trained LLM for multimodal context decoding, and (iii) a visual decoder for predicting bounding boxes given contextual object words. The new generate-then-detect framework enables us to detect object words within human vocabulary. Extensive experiments show the advantages of ContextDET on our proposed CODE benchmark, open-vocabulary detection, and referring image segmentation. Github: https://github.com/yuhangzang/ContextDET.

* Github: https://github.com/yuhangzang/ContextDET, Project Page: https://www.mmlab-ntu.com/project/contextdet/index.html

Via

Access Paper or Ask Questions

Projected Gradient Descent Algorithms for Solving Nonlinear Inverse Problems with Generative Priors

Sep 21, 2022

Zhaoqiang Liu, Jun Han

Figure 1 for Projected Gradient Descent Algorithms for Solving Nonlinear Inverse Problems with Generative Priors

Figure 2 for Projected Gradient Descent Algorithms for Solving Nonlinear Inverse Problems with Generative Priors

Figure 3 for Projected Gradient Descent Algorithms for Solving Nonlinear Inverse Problems with Generative Priors

Figure 4 for Projected Gradient Descent Algorithms for Solving Nonlinear Inverse Problems with Generative Priors

Abstract:In this paper, we propose projected gradient descent (PGD) algorithms for signal estimation from noisy nonlinear measurements. We assume that the unknown $p$-dimensional signal lies near the range of an $L$-Lipschitz continuous generative model with bounded $k$-dimensional inputs. In particular, we consider two cases when the nonlinear link function is either unknown or known. For unknown nonlinearity, similarly to \cite{liu2020generalized}, we make the assumption of sub-Gaussian observations and propose a linear least-squares estimator. We show that when there is no representation error and the sensing vectors are Gaussian, roughly $O(k \log L)$ samples suffice to ensure that a PGD algorithm converges linearly to a point achieving the optimal statistical rate using arbitrary initialization. For known nonlinearity, we assume monotonicity as in \cite{yang2016sparse}, and make much weaker assumptions on the sensing vectors and allow for representation error. We propose a nonlinear least-squares estimator that is guaranteed to enjoy an optimal statistical rate. A corresponding PGD algorithm is provided and is shown to also converge linearly to the estimator using arbitrary initialization. In addition, we present experimental results on image datasets to demonstrate the performance of our PGD algorithms.

Via

Access Paper or Ask Questions

DL4SciVis: A State-of-the-Art Survey on Deep Learning for Scientific Visualization

Apr 13, 2022

Chaoli Wang, Jun Han

Figure 1 for DL4SciVis: A State-of-the-Art Survey on Deep Learning for Scientific Visualization

Figure 2 for DL4SciVis: A State-of-the-Art Survey on Deep Learning for Scientific Visualization

Figure 3 for DL4SciVis: A State-of-the-Art Survey on Deep Learning for Scientific Visualization

Figure 4 for DL4SciVis: A State-of-the-Art Survey on Deep Learning for Scientific Visualization

Abstract:Since 2016, we have witnessed the tremendous growth of artificial intelligence+visualization (AI+VIS) research. However, existing survey papers on AI+VIS focus on visual analytics and information visualization, not scientific visualization (SciVis). In this paper, we survey related deep learning (DL) works in SciVis, specifically in the direction of DL4SciVis: designing DL solutions for solving SciVis problems. To stay focused, we primarily consider works that handle scalar and vector field data but exclude mesh data. We classify and discuss these works along six dimensions: domain setting, research task, learning type, network architecture, loss function, and evaluation metric. The paper concludes with a discussion of the remaining gaps to fill along the discussed dimensions and the grand challenges we need to tackle as a community. This state-of-the-art survey guides SciVis researchers in gaining an overview of this emerging topic and points out future directions to grow this research.

* 20 pages, 2 figures, and 12 tables. To Appear in IEEE Transactions on Visualization and Computer Graphics

Via

Access Paper or Ask Questions

Generative Principal Component Analysis

Mar 18, 2022

Zhaoqiang Liu, Jiulong Liu, Subhroshekhar Ghosh, Jun Han, Jonathan Scarlett

Figure 1 for Generative Principal Component Analysis

Figure 2 for Generative Principal Component Analysis

Figure 3 for Generative Principal Component Analysis

Figure 4 for Generative Principal Component Analysis

Abstract:In this paper, we study the problem of principal component analysis with generative modeling assumptions, adopting a general model for the observed matrix that encompasses notable special cases, including spiked matrix recovery and phase retrieval. The key assumption is that the underlying signal lies near the range of an $L$-Lipschitz continuous generative model with bounded $k$-dimensional inputs. We propose a quadratic estimator, and show that it enjoys a statistical rate of order $\sqrt{\frac{k\log L}{m}}$, where $m$ is the number of samples. We also provide a near-matching algorithm-independent lower bound. Moreover, we provide a variant of the classic power method, which projects the calculated data onto the range of the generative model during each iteration. We show that under suitable conditions, this method converges exponentially fast to a point achieving the above-mentioned statistical rate. We perform experiments on various image datasets for spiked matrix and phase retrieval models, and illustrate performance gains of our method to the classic power method and the truncated power method devised for sparse principal component analysis.

* ICLR 2022 paper + additional appendix on algorithm-independent lower bounds

Via

Access Paper or Ask Questions

Robust 1-bit Compressive Sensing with Partial Gaussian Circulant Matrices and Generative Priors

Aug 08, 2021

Zhaoqiang Liu, Subhroshekhar Ghosh, Jun Han, Jonathan Scarlett

Figure 1 for Robust 1-bit Compressive Sensing with Partial Gaussian Circulant Matrices and Generative Priors

Figure 2 for Robust 1-bit Compressive Sensing with Partial Gaussian Circulant Matrices and Generative Priors

Figure 3 for Robust 1-bit Compressive Sensing with Partial Gaussian Circulant Matrices and Generative Priors

Figure 4 for Robust 1-bit Compressive Sensing with Partial Gaussian Circulant Matrices and Generative Priors

Abstract:In 1-bit compressive sensing, each measurement is quantized to a single bit, namely the sign of a linear function of an unknown vector, and the goal is to accurately recover the vector. While it is most popular to assume a standard Gaussian sensing matrix for 1-bit compressive sensing, using structured sensing matrices such as partial Gaussian circulant matrices is of significant practical importance due to their faster matrix operations. In this paper, we provide recovery guarantees for a correlation-based optimization algorithm for robust 1-bit compressive sensing with randomly signed partial Gaussian circulant matrices and generative models. Under suitable assumptions, we match guarantees that were previously only known to hold for i.i.d.~Gaussian matrices that require significantly more computation. We make use of a practical iterative algorithm, and perform numerical experiments on image datasets to corroborate our theoretical results.

Via

Access Paper or Ask Questions

TS4Net: Two-Stage Sample Selective Strategy for Rotating Object Detection

Aug 06, 2021

Kai Feng, Weixing Li, Jun Han, Feng Pan, Dongdong Zheng

Figure 1 for TS4Net: Two-Stage Sample Selective Strategy for Rotating Object Detection

Figure 2 for TS4Net: Two-Stage Sample Selective Strategy for Rotating Object Detection

Figure 3 for TS4Net: Two-Stage Sample Selective Strategy for Rotating Object Detection

Figure 4 for TS4Net: Two-Stage Sample Selective Strategy for Rotating Object Detection

Abstract:Rotating object detection has wide applications in aerial photographs, remote sensing images, UAVs, etc. At present, most of the rotating object detection datasets focus on the field of remote sensing, and these images are usually shot in high-altitude scenes. However, image datasets captured at low-altitude areas also should be concerned, such as drone-based datasets. So we present a low-altitude dronebased dataset, named UAV-ROD, aiming to promote the research and development in rotating object detection and UAV applications. The UAV-ROD consists of 1577 images and 30,090 instances of car category annotated by oriented bounding boxes. In particular, The UAV-ROD can be utilized for the rotating object detection, vehicle orientation recognition and object counting tasks. Compared with horizontal object detection, the regression stage of the rotation detection is a tricky problem. In this paper, we propose a rotating object detector TS4Net, which contains anchor refinement module (ARM) and two-stage sample selective strategy (TS4). The ARM can convert preseted horizontal anchors into high-quality rotated anchors through twostage anchor refinement. The TS4 module utilizes different constrained sample selective strategies to allocate positive and negative samples, which is adaptive to the regression task in different stages. Benefiting from the ARM and TS4, the TS4Net can achieve superior performance for rotating object detection solely with one preseted horizontal anchor. Extensive experimental results on UAV-ROD dataset and three remote sensing datasets DOTA, HRSC2016 and UCAS-AOD demonstrate that our method achieves competitive performance against most state-of-the-art methods.

* 12 pages, 11 figures

Via

Access Paper or Ask Questions