Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Cunhua Pan

Fellow, IEEE

Selective Depthwise Separable Convolution for Lightweight Joint Source-Channel Coding in Wireless Image Transmission

Apr 24, 2026

Ming Ye, Kui Cai, Cunhua Pan, Zhen Mei, Wanting Yang, Chunguo Li

Abstract:Depthwise separable convolutional (DSConv) layers have been successfully applied to deep learning (DL)-based joint source-channel coding (JSCC) schemes to reduce computational complexity. However, a systematic investigation of the layerwise and ratio-wise replacement of standard convolutional (Conv) layers with DSConv layers in JSCC systems for wireless image transmission remains largely unexplored. In this letter, we propose a configurable lightweight JSCC framework that incorporates a selective replacement strategy, enabling flexible substitution of standard Conv layers with DSConv layers at various layer positions and replacement ratios. By adjusting the proportion of layers replaced, we achieve different model compression levels and analyze their impact on reconstruction performance. Furthermore, we investigate how replacements at different encoder and decoder depths influence reconstruction quality under a fixed replacement ratio. Our results show that Conv-to-DSConv replacement at intermediate layers achieves a favorable complexity-performance trade-off, revealing layer-wise redundancy in DL-based JSCC systems. Extensive experiments further demonstrate that the proposed framework achieves substantial parameter reduction with only slight performance degradation, enabling flexible complexity-performance trade-offs for resource-constrained edge devices.

* 5 pages, 6 figures, journal

Via

Access Paper or Ask Questions

Spherical Antenna Arrays for Future Communications: Principles, Applications, and Research Directions

Apr 08, 2026

Cunhua Pan, Xianzhe Chen, Hong Ren, Jiangzhou Wang

Abstract:With the development of 6G technologies, traditional uniform linear arrays (ULAs) and uniform planar arrays (UPAs) can hardly meet the demands of three-dimensional (3D) full-space coverage and high angular resolution. Spherical antenna arrays (SAAs), with elements uniformly distributed on a spherical surface, provide an effective solution. This article analyzes the issues of traditional arrays, summarizes the advantages and typical structures of SAAs, discusses their potential application scenarios, and verifies their superiority over UPAs via a case study. Finally, key technical challenges and corresponding research directions of SAAs are identified, providing a reference for their research and application in future wireless communications.

Via

Access Paper or Ask Questions

A BEV-Fusion Based Framework for Sequential Multi-Modal Beam Prediction in mmWave Systems

Apr 07, 2026

Jiaming Zeng, Cunhua Pan, Haoyang Weng, Ruijing Liu, Hong Ren, Jiangzhou Wang

Abstract:Beam prediction is critical for reducing beam-training overhead in millimeter-wave (mmWave) systems, especially in high-mobility vehicular scenarios. This paper presents a BEV-Fusion based framework that unifies camera, LiDAR, radar, and GPS modalities in a shared bird's-eye-view (BEV) representation for spatially consistent multi-modal fusion. Unlike priorapproaches that fuse globally pooled one-dimensional features, the proposed method performs fusion in BEV space to preservecross-modal geometric structure and visual semantic density. A learned camera-to-BEV module based on cross-attention is adopted to generate BEV-aligned visual features without relying on precise camera calibration, and a temporal transformer is used to aggregate five-step sequential observations for motion-aware beam prediction. Experiments on the DeepSense 6G benchmark show that BEV-Fusion achieves approximately 87% distance- based accuracy (DBA) on scenarios 32, 33 and 34, outperforming the TransFuser baseline. These results indicate that BEV-space fusion provides an effective spatial abstraction for sensing-assisted beam prediction.

* 13pages,7figures

Via

Access Paper or Ask Questions

Ground Reflection-Aided TomoSAR Imaging with 5G NR Signals

Apr 03, 2026

Qiuyuan Yang, Cunhua Pan, Hong Ren, Jiangzhou Wang

Abstract:Tomographic synthetic aperture radar (TomoSAR) enables three-dimensional imaging by resolving targets along the elevation dimension, which is essential for environment reconstruction and infrastructure monitoring. A critical challenge in TomoSAR is the severe multipath propagation that causes ghost targets, range offsets, and elevation ambiguities. To address this, this paper proposes an enhanced Newtonized orthogonal matching pursuit (NOMP) algorithm to extract the delay, Doppler, and complex amplitude parameters of each propagation path, effectively separating line-of-sight (LoS) and multipath components prior to TomoSAR processing. Additionally, a height fusion strategy combining TomoSAR estimates with LoS-ground reflection delay-based inversion improves elevation accuracy. Simulation results demonstrate that the proposed method achieves improved positioning and elevation accuracy while effectively suppressing multipath-induced artifacts.

Via

Access Paper or Ask Questions

Spherical Antenna Arrays: Principles, Applications, and Research Directions

Apr 03, 2026

Cunhua Pan, Xianzhe Chen, Hong Ren, Jiangzhou Wang

Via

Access Paper or Ask Questions

Fairness-Aware Beamforming for Polarimetric ISAC Systems with Polarization-Reconfigurable Antennas

Mar 18, 2026

Weijie Xiong, Jingran Lin, Di Jiang, Cunhua Pan, Hongli Liu, Kai Zhong, Qiang Li

Abstract:Polarization diversity offers significant flexibility for enhancing integrated sensing and communications (ISAC). However, conventional dual-polarized arrays typically require dedicated radio-frequency (RF) chains for each polarization branch, leading to prohibitive hardware costs. To address this, polarization-reconfigurable (PR) antennas have emerged as a cost-effective alternative, enabling polarization flexibility with reduced hardware complexity by driving two polarization branches with a single RF chain. In this paper, we investigate fairness-aware beamforming for ISAC systems equipped with PR antennas. Specifically, we jointly optimize the transmit beamforming and PR control coefficients to maximize the minimum signal-to-interference-plus-noise ratio (SINR) for communication users and the minimum signal-to-clutter-plus-noise ratio (SCNR) for sensing targets. The resulting problem is highly nonconvex and nonsmooth due to the strong coupling among optimization variables in the max-min objective, as well as the nonconvex spherical constraints imposed by the PR antennas. To tackle this, we derive an equivalent smooth reformulation by introducing auxiliary variables and transforming the minimum operators into inequality constraints. Subsequently, we develop an exact-penalty product Riemannian manifold gradient descent (EP-PRMGD) algorithm, which integrates an exact penalty method with Riemannian optimization to guarantee convergence to a Karush-Kuhn-Tucker (KKT) point. Numerical results demonstrate that the proposed PR-enabled ISAC scheme achieves performance comparable to dual-polarized architectures while utilizing only half the RF chains, thereby validating its effectiveness in balancing fairness and hardware efficiency.

Via

Access Paper or Ask Questions

Subspace Fusion Sensing for Cooperative ISAC

Mar 09, 2026

Yining Xu, Cunhua Pan, Jun Tang, Hong Ren, Jiangzhou Wang

Abstract:This paper proposes a subspace fusion sensing algorithm for cooperative integrated sensing and communication. First, we stack the received signals from access points (APs) into a third-order tensor and construct the equivalent virtual antenna (EVA) array via tensor unfolding. Then, a data association-free subspace-based fusion sensing algorithm is developed utilizing the EVA arrays from distributed APs. A derivation of Cramer-Rao lower bound (CRLB) is also presented. Finally, simulation results validate the effectiveness of the proposed algorithm compared to traditional techniques.

* This work has been submitted to the IEEE for possible publication

Via

Access Paper or Ask Questions

TokenCom: Vision-Language Model for Multimodal and Multitask Token Communications

Feb 28, 2026

Feibo Jiang, Siwei Tu, Li Dong, Xiaolong Li, Kezhi Wang, Cunhua Pan, Zhu Han, Jiangzhou Wang

Abstract:Visual-Language Models (VLMs), with their strong capabilities in image and text understanding, offer a solid foundation for intelligent communications. However, their effectiveness is constrained by limited token granularity, overlong visual token sequences, and inadequate cross-modal alignment. To overcome these challenges, we propose TaiChi, a novel VLM framework designed for token communications. TaiChi adopts a dual-visual tokenizer architecture that processes both high- and low-resolution images to collaboratively capture pixel-level details and global conceptual features. A Bilateral Attention Network (BAN) is introduced to intelligently fuse multi-scale visual tokens, thereby enhancing visual understanding and producing compact visual tokens. In addition, a Kolmogorov Arnold Network (KAN)-based modality projector with learnable activation functions is employed to achieve precise nonlinear alignment from visual features to the text semantic space, thus minimizing information loss. Finally, TaiChi is integrated into a multimodal and multitask token communication system equipped with a joint VLM-channel coding scheme. Experimental results validate the superior performance of TaiChi, as well as the feasibility and effectiveness of the TaiChi-driven token communication system.

Via

Access Paper or Ask Questions

U-Net-Based Generative Joint Source-Channel Coding for Wireless Image Transmission

Feb 26, 2026

Ming Ye, Kui Cai, Cunhua Pan, Zhen Mei, Wanting Yang, Chunguo Li

Abstract:Deep learning (DL)-based joint source-channel coding (JSCC) methods have achieved remarkable success in wireless image transmission. However, these methods either focus on conventional distortion metrics that do not necessarily yield high perceptual quality or incur high computational complexity. In this paper, we propose two DL-based JSCC (DeepJSCC) methods that leverage deep generative architectures for wireless image transmission. Specifically, we propose G-UNet-JSCC, a scheme comprising an encoder and a U-Net-based generator serving as the decoder. Its skip connections enable multi-scale feature fusion to improve both pixel-level fidelity and perceptual quality of reconstructed images by integrating low- and high-level features. To further enhance pixel-level fidelity, the encoder and the U-Net-based decoder are jointly optimized using a weighted sum of structural similarity and mean-squared error (MSE) losses. Building upon G-UNet-JSCC, we further develop a DeepJSCC method called cGAN-JSCC, where the decoder is enhanced through adversarial training. In this scheme, we retain the encoder of G-UNet-JSCC and adversarially train the decoder's generator against a patch-based discriminator. cGAN-JSCC employs a two-stage training procedure. The outer stage trains the encoder and the decoder end-to-end using an MSE loss, while the inner stage adversarially trains the decoder's generator and the discriminator by minimizing a joint loss combining adversarial and distortion losses. Simulation results demonstrate that the proposed methods achieve superior pixel-level fidelity and perceptual quality on both high- and low-resolution images. For low-resolution images, cGAN-JSCC achieves better reconstruction performance and greater robustness to channel variations than G-UNet-JSCC.

* This work has been submitted to the IEEE for possible publication

Via

Access Paper or Ask Questions

Theoretical and Empirical Study of Spatial Power Focusing Effect for Sparse Arrays at Terahertz Band

Nov 19, 2025

Yongchao He, Taihao Zhang, Cunhua Pan, Hong Ren, Xianzhe Chen, Tian Qiu, Bingchang Hua, Jiangzhou Wang

Abstract:This work investigates the spatial power focusing effect for large-scale sparse arrays at terahertz (THz) band, combining theoretical analysis with experimental validation. Specifically, based on a Green's function channel model, we analyze the power distribution along the $z$-axis, deriving a closed-form expression to characterize the focusing effect. Furthermore, the factors influencing the focusing effect, including phase noise and positional deviations, are theoretically analyzed and numerically simulated. Finally, a 300 GHz measurement platform based on a vector network analyzer (VNA) is constructed for experimental validation. The measurement results demonstrate close consistence with theoretical simulation results, confirming the spatial power focusing effect for sparse arrays.

* 5 pages, 7 figures

Via

Access Paper or Ask Questions