Fellow, IEEE
Abstract:With the development of 6G technologies, traditional uniform linear arrays (ULAs) and uniform planar arrays (UPAs) can hardly meet the demands of three-dimensional (3D) full-space coverage and high angular resolution. Spherical antenna arrays (SAAs), with elements uniformly distributed on a spherical surface, provide an effective solution. This article analyzes the issues of traditional arrays, summarizes the advantages and typical structures of SAAs, discusses their potential application scenarios, and verifies their superiority over UPAs via a case study. Finally, key technical challenges and corresponding research directions of SAAs are identified, providing a reference for their research and application in future wireless communications.
Abstract:Beam prediction is critical for reducing beam-training overhead in millimeter-wave (mmWave) systems, especially in high-mobility vehicular scenarios. This paper presents a BEV-Fusion based framework that unifies camera, LiDAR, radar, and GPS modalities in a shared bird's-eye-view (BEV) representation for spatially consistent multi-modal fusion. Unlike priorapproaches that fuse globally pooled one-dimensional features, the proposed method performs fusion in BEV space to preservecross-modal geometric structure and visual semantic density. A learned camera-to-BEV module based on cross-attention is adopted to generate BEV-aligned visual features without relying on precise camera calibration, and a temporal transformer is used to aggregate five-step sequential observations for motion-aware beam prediction. Experiments on the DeepSense 6G benchmark show that BEV-Fusion achieves approximately 87% distance- based accuracy (DBA) on scenarios 32, 33 and 34, outperforming the TransFuser baseline. These results indicate that BEV-space fusion provides an effective spatial abstraction for sensing-assisted beam prediction.
Abstract:With the development of 6G technologies, traditional uniform linear arrays (ULAs) and uniform planar arrays (UPAs) can hardly meet the demands of three-dimensional (3D) full-space coverage and high angular resolution. Spherical antenna arrays (SAAs), with elements uniformly distributed on a spherical surface, provide an effective solution. This article analyzes the issues of traditional arrays, summarizes the advantages and typical structures of SAAs, discusses their potential application scenarios, and verifies their superiority over UPAs via a case study. Finally, key technical challenges and corresponding research directions of SAAs are identified, providing a reference for their research and application in future wireless communications.
Abstract:Tomographic synthetic aperture radar (TomoSAR) enables three-dimensional imaging by resolving targets along the elevation dimension, which is essential for environment reconstruction and infrastructure monitoring. A critical challenge in TomoSAR is the severe multipath propagation that causes ghost targets, range offsets, and elevation ambiguities. To address this, this paper proposes an enhanced Newtonized orthogonal matching pursuit (NOMP) algorithm to extract the delay, Doppler, and complex amplitude parameters of each propagation path, effectively separating line-of-sight (LoS) and multipath components prior to TomoSAR processing. Additionally, a height fusion strategy combining TomoSAR estimates with LoS-ground reflection delay-based inversion improves elevation accuracy. Simulation results demonstrate that the proposed method achieves improved positioning and elevation accuracy while effectively suppressing multipath-induced artifacts.
Abstract:Due to the directive property of each antenna element, the received signal power can be severely attenuated when the emitter deviates from the array boresight, which will lead to a severe degradation in sensing performance along the corresponding direction. Although existing rotatable array sensing methods such as recursive rotation (RR-Root-MUSIC) can mitigate this issue by iteratively rotating and sensing, several mechanical rotations and repeated eigendecomposition operations are required to yield a high computational complexity and low time-efficiency. To address this problem, a pre-rotation initialization with recieve power as a rule is proposed to signifcantly reduce the computational complexity and improve the time-efficiency. Using this idea, a low-complexity enhanced direction-sensing framework with pre-rotation initialization and iterative greedy spatial-spectrum search (PRI-IGSS) is develped with three stages: (1) the normal vector of array is rotated to a set of candidates to find the opimal direction with the maximum sensing energy with the corresponding DOA value computed by the Root-MUSIC algorithm; (2) the array is mechanically rotated to the initial estimated direction and kept fixed; (3) an iterative greedy spatial-spectrum search or recieving beamforming method, moviated by reinforcement learning, is designed with a reduced search range and making a summation of all previous sampling variance matrices and the current one is adopted to provide an increasiong performance gain as the iteration process continues. To assess the performance of the proposed method, the corresponding CRLB is derived with a simplified rotation model. Simulation results demonstrate that the proposed PRI-IGSS method performs much better than RR-Root-MUSIC and achieves the CRLB in term of mean squared error due to the fact there is no sample accumulation for the latter.
Abstract:This paper investigates secure Directional Modulation (DM) design enhanced by a rotatable active Reconfigurable Intelligent Surface (RIS). In conventional RIS-assisted DM networks, the security performance gain is limited due to the multiplicative path loss introduced by the RIS reflection path. To address this challenge, a Secrecy Rate (SR) maximization problem is formulated, subject to constraints including the eavesdropper's Direction Of Arrival (DOA) estimation performance, transmit power, rotatable range, and maximum reflection amplitude of the RIS elements. To solve this non-convex optimization problem, three algorithms are proposed: a multi-stream null-space projection and leakage-based method, an enhanced leakage-based method, and an optimization scheme based on the Distributed Soft Actor-Critic with Three refinements (DSAC-T). Simulation results validate the effectiveness of the proposed algorithms. A performance trade-off is observed between eavesdropper's DOA estimation accuracy and the achievable SR. The security enhancement provided by the RIS is more significant in systems equipped with a small number of antennas. By optimizing the orientation of the RIS, a 52.6\% improvement in SR performance can be achieved.
Abstract:This paper proposes a subspace fusion sensing algorithm for cooperative integrated sensing and communication. First, we stack the received signals from access points (APs) into a third-order tensor and construct the equivalent virtual antenna (EVA) array via tensor unfolding. Then, a data association-free subspace-based fusion sensing algorithm is developed utilizing the EVA arrays from distributed APs. A derivation of Cramer-Rao lower bound (CRLB) is also presented. Finally, simulation results validate the effectiveness of the proposed algorithm compared to traditional techniques.
Abstract:In this paper, we consider a synthetic aperture secure beamforming approach for a virtual multiple-input multiple output (MIMO) broadcast channel in the presence of hybrid wiretapping environments. Our goal is to design the flight node deployment constructed by a single-antenna mobile autonomous aerial vehicle (AAV), corresponding transmission symbol strategy, transmit precoding, and received beamforming to maximize the system channel capacity. Leveraging the synthetic aperture beamforming, we aim to provide spatial gain along a predefined angle in free space while reducing it in others and thus enhance physical layer (PHY) security. To this end, we analyze the expression of the asymptotic channel eigenvalues to optimize the AAV flight node deployment. For the optimal precoding design, an energy-efficient method that minimizes the transmit power consumption is studied based on the given virtual MIMO channel, while meeting the quality of service (QoS) for the base station (BS), leakage tolerance of eavesdroppers (Eves), and per-node power constraints. The power minimization problem is a non convex program, which is then reformulated as a tractable form after some mathematical manipulations. Moreover, we design the received beamforming by applying the linearly constrained minimum variance (LCMV) method such that the jamming can be effectively suppressed. Numerical results demonstrate the superiority of the proposed method in promoting capacity.
Abstract:Visual-Language Models (VLMs), with their strong capabilities in image and text understanding, offer a solid foundation for intelligent communications. However, their effectiveness is constrained by limited token granularity, overlong visual token sequences, and inadequate cross-modal alignment. To overcome these challenges, we propose TaiChi, a novel VLM framework designed for token communications. TaiChi adopts a dual-visual tokenizer architecture that processes both high- and low-resolution images to collaboratively capture pixel-level details and global conceptual features. A Bilateral Attention Network (BAN) is introduced to intelligently fuse multi-scale visual tokens, thereby enhancing visual understanding and producing compact visual tokens. In addition, a Kolmogorov Arnold Network (KAN)-based modality projector with learnable activation functions is employed to achieve precise nonlinear alignment from visual features to the text semantic space, thus minimizing information loss. Finally, TaiChi is integrated into a multimodal and multitask token communication system equipped with a joint VLM-channel coding scheme. Experimental results validate the superior performance of TaiChi, as well as the feasibility and effectiveness of the TaiChi-driven token communication system.
Abstract:Accurate channel state information (CSI) is vital for multiple-input multiple-output (MIMO) systems. However, superimposed pilots (SIP), which reduce overhead, introduce severe pilot contamination and data interference, complicating joint channel estimation and data detection. This paper proposes a conditional flow matching receiver (CFM-Rx), an unsupervised generative framework that learns directly from received signals, eliminating the need for labeled data and improving adaptability across diverse system settings. By leveraging flow-based generative modeling, CFM-Rx enables deterministic, low-latency inference and exploits model invertibility to capture the bidirectional nature of signal propagation. This framework unifies flow matching with score-based diffusion modeling via a moment-consistent ordinary differential equation (ODE), replacing stochastic differential equation (SDE) sampling with a deterministic and efficient process. Furthermore, it integrates receiver-side priors to ensure stable, data-consistent inference. Extensive simulation results across various MIMO configurations demonstrate that CFM-Rx consistently outperforms conventional estimators and state-of-the-art data-driven receivers, achieving notable gains in channel estimation accuracy and symbol detection robustness, particularly under severe pilot contamination.