Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hongliang Zhang

Tree-Mamba: A Tree-Aware Mamba for Underwater Monocular Depth Estimation

Jul 10, 2025

Peixian Zhuang, Yijian Wang, Zhenqi Fu, Hongliang Zhang, Sam Kwong, Chongyi Li

Abstract:Underwater Monocular Depth Estimation (UMDE) is a critical task that aims to estimate high-precision depth maps from underwater degraded images caused by light absorption and scattering effects in marine environments. Recently, Mamba-based methods have achieved promising performance across various vision tasks; however, they struggle with the UMDE task because their inflexible state scanning strategies fail to model the structural features of underwater images effectively. Meanwhile, existing UMDE datasets usually contain unreliable depth labels, leading to incorrect object-depth relationships between underwater images and their corresponding depth maps. To overcome these limitations, we develop a novel tree-aware Mamba method, dubbed Tree-Mamba, for estimating accurate monocular depth maps from underwater degraded images. Specifically, we propose a tree-aware scanning strategy that adaptively constructs a minimum spanning tree based on feature similarity. The spatial topological features among the tree nodes are then flexibly aggregated through bottom-up and top-down traversals, enabling stronger multi-scale feature representation capabilities. Moreover, we construct an underwater depth estimation benchmark (called BlueDepth), which consists of 38,162 underwater image pairs with reliable depth labels. This benchmark serves as a foundational dataset for training existing deep learning-based UMDE methods to learn accurate object-depth relationships. Extensive experiments demonstrate the superiority of the proposed Tree-Mamba over several leading methods in both qualitative results and quantitative evaluations with competitive computational efficiency. Code and dataset will be available at https://wyjgr.github.io/Tree-Mamba.html.

Via

Access Paper or Ask Questions

Simultaneously Exposing and Jamming Covert Communications via Disco Reconfigurable Intelligent Surfaces

May 18, 2025

Huan Huang, Hongliang Zhang, Yi Cai, Dusit Niyato, A. Lee Swindlehurst, Zhu Han

Abstract:Covert communications provide a stronger privacy protection than cryptography and physical-layer security (PLS). However, previous works on covert communications have implicitly assumed the validity of channel reciprocity, i.e., wireless channels remain constant or approximately constant during their coherence time. In this work, we investigate covert communications in the presence of a disco RIS (DRIS) deployed by the warden Willie, where the DRIS with random and time-varying reflective coefficients acts as a "disco ball", introducing timevarying fully-passive jamming (FPJ). Consequently, the channel reciprocity assumption no longer holds. The DRIS not only jams the covert transmissions between Alice and Bob, but also decreases the error probabilities of Willie's detections, without either Bob's channel knowledge or additional jamming power. To quantify the impact of the DRIS on covert communications, we first design a detection rule for the warden Willie in the presence of time-varying FPJ introduced by the DRIS. Then, we define the detection error probabilities, i.e., the false alarm rate (FAR) and the missed detection rate (MDR), as the monitoring performance metrics for Willie's detections, and the signal-to-jamming-plusnoise ratio (SJNR) as a communication performance metric for the covert transmissions between Alice and Bob. Based on the detection rule, we derive the detection threshold for the warden Willie to detect whether communications between Alice and Bob is ongoing, considering the time-varying DRIS-based FPJ. Moreover, we conduct theoretical analyses of the FAR and the MDR at the warden Willie, as well as SJNR at Bob, and then present unique properties of the DRIS-based FPJ in covert communications. We present numerical results to validate the derived theoretical analyses and evaluate the impact of DRIS on covert communications.

* This paper has been submitted for publication

Via

Access Paper or Ask Questions

Building a Human-Verified Clinical Reasoning Dataset via a Human LLM Hybrid Pipeline for Trustworthy Medical AI

May 11, 2025

Chao Ding, Mouxiao Bian, Pengcheng Chen, Hongliang Zhang, Tianbin Li, Lihao Liu, Jiayuan Chen, Zhuoran Li, Yabei Zhong, Yongqi Liu(+4 more)

Abstract:Despite strong performance in medical question-answering, the clinical adoption of Large Language Models (LLMs) is critically hampered by their opaque 'black-box' reasoning, limiting clinician trust. This challenge is compounded by the predominant reliance of current medical LLMs on corpora from scientific literature or synthetic data, which often lack the granular expert validation and high clinical relevance essential for advancing their specialized medical capabilities. To address these critical gaps, we introduce a highly clinically relevant dataset with 31,247 medical question-answer pairs, each accompanied by expert-validated chain-of-thought (CoT) explanations. This resource, spanning multiple clinical domains, was curated via a scalable human-LLM hybrid pipeline: LLM-generated rationales were iteratively reviewed, scored, and refined by medical experts against a structured rubric, with substandard outputs revised through human effort or guided LLM regeneration until expert consensus. This publicly available dataset provides a vital source for the development of medical LLMs that capable of transparent and verifiable reasoning, thereby advancing safer and more interpretable AI in medicine.

Via

Access Paper or Ask Questions

WiFi-Diffusion: Achieving Fine-Grained WiFi Radio Map Estimation With Ultra-Low Sampling Rate by Diffusion Models

Mar 15, 2025

Zhiyuan Liu, Shuhang Zhang, Qingyu Liu, Hongliang Zhang, Lingyang Song

Abstract:Fine-grained radio map presents communication parameters of interest, e.g., received signal strength, at every point across a large geographical region. It can be leveraged to improve the efficiency of spectrum utilization for a large area, particularly critical for the unlicensed WiFi spectrum. The problem of fine-grained radio map estimation is to utilize radio samples collected by sparsely distributed sensors to infer the map. This problem is challenging due to the ultra-low sampling rate, where the number of available samples is far less than the fine-grained resolution required for radio map estimation. We propose WiFi-Diffusion -- a novel generative framework for achieving fine-grained WiFi radio map estimation using diffusion models. WiFi-Diffusion employs the creative power of generative AI to address the ultra-low sampling rate challenge and consists of three blocks: 1) a boost block, using prior information such as the layout of obstacles to optimize the diffusion model; 2) a generation block, leveraging the diffusion model to generate a candidate set of radio maps; and 3) an election block, utilizing the radio propagation model as a guide to find the best radio map from the candidate set. Extensive simulations demonstrate that 1) the fine-grained radio map generated by WiFi-Diffusion is ten times better than those produced by state-of-the-art (SOTA) when they use the same ultra-low sampling rate; and 2) WiFi-Diffusion achieves comparable fine-grained radio map quality with only one-fifth of the sampling rate required by SOTA.

Via

Access Paper or Ask Questions

Simultaneous Beamforming and Anti-Jamming With Intelligent Omni-Surfaces

Feb 04, 2025

Yuhan Wang, Shuhao Zeng, Qingyu Liu, Boya Di, Hongliang Zhang

Abstract:Wireless transmission is vulnerable to malicious jamming attacks due to the openness of wireless channels, posing a severe threat to wireless communications. Current anti-jamming studies primarily focus on either enhancing desired signals or mitigating jamming, resulting in limited performance. To address this issue, intelligent omni-surface (IOS) is a promising solution. By jointly designing its reflective and refractive properties, the IOS can simultaneously nullify jamming and enhance desired signals. In this paper, we consider an IOS-aided multi-user anti-jamming communication system, aiming to improve desired signals and nullify jamming by optimizing IOS phase shifts and transmit beamforming. However, this is challenging due to the coupled and discrete IOS reflection and refraction phase shifts, the unknown jammer's beamformer, and imperfect jammer-related channel state information. To tackle this, we relax IOS phase shifts to continuous states and optimize with a coupling-aware algorithm using the Cauchy-Schwarz inequality and S-procedure, followed by a local search to recover discrete states. Simulation results show that the proposed scheme significantly improves the sum rate amid jamming attacks.

Via

Access Paper or Ask Questions

Beamforming Design for Wideband Near-Field Communications With Reconfigurable Refractive Surfaces

Jan 02, 2025

Zicheng Lin, Shuhao Zeng, Aryan Kaushik, Hongliang Zhang

$Figure 1 for Beamforming Design for Wideband Near-Field Communications With Reconfigurable Refractive Surfaces$

$Figure 2 for Beamforming Design for Wideband Near-Field Communications With Reconfigurable Refractive Surfaces$

$Figure 3 for Beamforming Design for Wideband Near-Field Communications With Reconfigurable Refractive Surfaces$

$Figure 4 for Beamforming Design for Wideband Near-Field Communications With Reconfigurable Refractive Surfaces$

Abstract:To meet the growing demand for high data rates, cellular systems are expected to evolve towards higher carrier frequencies and larger antenna arrays, but conventional phased arrays face challenges in supporting such a prospection due to their excessive power consumption induced by numerous phase shifters required. Reconfigurable Refractive Surface (RRS) is an energy efficient solution to address this issue without relying on phase shifters. However, the increased radiation aperture size extends the range of the Fresnel region, leading the users to lie in the near-field zone. Moreover, given the wideband communications in higher frequency bands, we cannot ignore the frequency selectivity of the RRS. These two effects collectively exacerbate the beam-split issue, where different frequency components fail to converge on the user simultaneously, and finally result in a degradation of the data rate. In this paper, we investigate an RRS-based wideband near-field multi-user communication system. Unlike most existing studies on wideband communications, which consider the beam-split effect only with the near-field condition, we study the beam-split effect under the influence of both the near-field condition and the frequency selectivity of the RRS. To mitigate the beam-split effect, we propose a Delayed-RRS structure, based on which a beamforming scheme is proposed to optimize the user's data rate. Through theoretical analysis and simulation results, we analyze the influence of the RRS's frequency selectivity, demonstrate the effectiveness of the proposed beamforming scheme, and reveal the importance of jointly considering the near-field condition and the frequency selectivity of RRS.

* 13 pages, 10 figures

Via

Access Paper or Ask Questions

Towards Better Spherical Sliced-Wasserstein Distance Learning with Data-Adaptive Discriminative Projection Direction

Dec 26, 2024

Hongliang Zhang, Shuo Chen, Lei Luo, Jian Yang

Abstract:Spherical Sliced-Wasserstein (SSW) has recently been proposed to measure the discrepancy between spherical data distributions in various fields, such as geology, medical domains, computer vision, and deep representation learning. However, in the original SSW, all projection directions are treated equally, which is too idealistic and cannot accurately reflect the importance of different projection directions for various data distributions. To address this issue, we propose a novel data-adaptive Discriminative Spherical Sliced-Wasserstein (DSSW) distance, which utilizes a projected energy function to determine the discriminative projection direction for SSW. In our new DSSW, we introduce two types of projected energy functions to generate the weights for projection directions with complete theoretical guarantees. The first type employs a non-parametric deterministic function that transforms the projected Wasserstein distance into its corresponding weight in each projection direction. This improves the performance of the original SSW distance with negligible additional computational overhead. The second type utilizes a neural network-induced function that learns the projection direction weight through a parameterized neural network based on data projections. This further enhances the performance of the original SSW distance with less extra computational overhead. Finally, we evaluate the performance of our proposed DSSW by comparing it with several state-of-the-art methods across a variety of machine learning tasks, including gradient flows, density estimation on real earth data, and self-supervised learning.

* Accepted by AAAI 2025

Via

Access Paper or Ask Questions

Disco Intelligent Omni-Surfaces: 360-degree Fully-Passive Jamming Attacks

Nov 20, 2024

Huan Huang, Hongliang Zhang, Jide Yuan, Luyao Sun, Yitian Wang, Weidong Mei, Boya Di, Yi Cai, Zhu Han

Figure 1 for Disco Intelligent Omni-Surfaces: 360-degree Fully-Passive Jamming Attacks

Figure 2 for Disco Intelligent Omni-Surfaces: 360-degree Fully-Passive Jamming Attacks

Figure 3 for Disco Intelligent Omni-Surfaces: 360-degree Fully-Passive Jamming Attacks

Figure 4 for Disco Intelligent Omni-Surfaces: 360-degree Fully-Passive Jamming Attacks

Abstract:Intelligent omni-surfaces (IOSs) with 360-degree electromagnetic radiation significantly improves the performance of wireless systems, while an adversarial IOS also poses a significant potential risk for physical layer security. In this paper, we propose a "DISCO" IOS (DIOS) based fully-passive jammer (FPJ) that can launch omnidirectional fully-passive jamming attacks. In the proposed DIOS-based FPJ, the interrelated refractive and reflective (R&R) coefficients of the adversarial IOS are randomly generated, acting like a "DISCO" that distributes wireless energy radiated by the base station. By introducing active channel aging (ACA) during channel coherence time, the DIOS-based FPJ can perform omnidirectional fully-passive jamming without neither jamming power nor channel knowledge of legitimate users (LUs). To characterize the impact of the DIOS-based PFJ, we derive the statistical characteristics of DIOS-jammed channels based on two widely-used IOS models, i.e., the constant-amplitude model and the variable-amplitude model. Consequently, the asymptotic analysis of the ergodic achievable sum rates under the DIOS-based omnidirectional fully-passive jamming is given based on the derived stochastic characteristics for both the two IOS models. Based on the derived analysis, the omnidirectional jamming impact of the proposed DIOS-based FPJ implemented by a constant-amplitude IOS does not depend on either the quantization number or the stochastic distribution of the DIOS coefficients, while the conclusion does not hold on when a variable-amplitude IOS is used. Numerical results based on one-bit quantization of the IOS phase shifts are provided to verify the effectiveness of the derived theoretical analysis. The proposed DIOS-based FPJ can not only launch omnidirectional fully-passive jamming, but also improve the jamming impact by about 55% at 10 dBm transmit power per LU.

* This paper has been submitted to IEEE TWC for possible publication

Via

Access Paper or Ask Questions

RIS-Aided Dual-Polarized MIMO: How Large a Surface is Needed to Beat Single Polarization?

Oct 30, 2024

Zizhou Zheng, Huan Huang, Hongliang Zhang, A. Lee Swindlehurst

Figure 1 for RIS-Aided Dual-Polarized MIMO: How Large a Surface is Needed to Beat Single Polarization?

Figure 2 for RIS-Aided Dual-Polarized MIMO: How Large a Surface is Needed to Beat Single Polarization?

Figure 3 for RIS-Aided Dual-Polarized MIMO: How Large a Surface is Needed to Beat Single Polarization?

Figure 4 for RIS-Aided Dual-Polarized MIMO: How Large a Surface is Needed to Beat Single Polarization?

Abstract:Dual-polarized (DP) multiple-input-multiple-output (MIMO) systems have been widely adopted in commercial mobile wireless communications. Such systems achieve multiplexing and diversity gain by exploiting the polarization dimension. However, existing studies have shown that the capacity of DP MIMO may not surpass that of single-polarized (SP) MIMO systems due to the cross-polarization coupling induced by the propagation environment. In this letter, we employ reconfigurable intelligent surfaces (RISs) to address this issue and investigate how large the surface should be to ensure a better performance for DP MIMO. Specifically, we first derive the capacities of DP and SP MIMO systems with an RIS, and then study the influence of the RIS size on the system capacity. Our analyses reveal how to deploy the RIS in a DP MIMO scenario.

Via

Access Paper or Ask Questions

Degrees of Freedom of Holographic MIMO in Multi-user Near-field Channels

Oct 07, 2024

Houfeng Chen, Shaohua Yue, Marco Di Renzo, Hongliang Zhang

Figure 1 for Degrees of Freedom of Holographic MIMO in Multi-user Near-field Channels

Figure 2 for Degrees of Freedom of Holographic MIMO in Multi-user Near-field Channels

Figure 3 for Degrees of Freedom of Holographic MIMO in Multi-user Near-field Channels

Abstract:Holographic multiple-input multiple-output (HMIMO) is an emerging technology for 6G communications, in which numerous antenna units are integrated in a limited space. As the HMIMO array aperture expands, the near-field region of the array is dramatically enlarged, resulting in more users being located in the near-field region. This creates new opportunities for wireless communications. In this context, the evaluation of the spatial degrees of freedom (DoF) of HMIMO multi-user systems in near-field channels is an open problem, as the methods of analysis utilized for evaluating the DoF in far-field channels cannnot be directly applied due to the different propagation characteristics. In this paper, we propose a novel method to calculate the DoF of HMIMO in multi-user near-field channels. We first derive the DoF for a single user in the near field, and then extend the analysis to multi-user scenarios. In this latter scenario, we focus on the impact of spatial blocking between HMIMO users. The derived analytical framework reveals that the DoF of HMIMO in multi-user near-field channels is not in general given by the sum of the DoF of the HMIMO single-user setting. Simulation results demonstrate that the proposed method can accurately estimate the DoF in HMIMO multi-user near-field channels in the presence of spatial blocking.

* 5pages,5figures

Via

Access Paper or Ask Questions