Abstract:Typical reconfigurable intelligent surface (RIS) implementations include metasurfaces with almost passive unit elements capable of reflecting their incident waves in controllable ways, enhancing wireless communications in a cost-effective manner. In this paper, we advance the concept of intelligent metasurfaces by introducing a flexible array geometry, termed flexible intelligent metasurface (FIM), which supports both element movement (EM) and passive beamforming (PBF). In particular, based on the single-input single-output (SISO) system setup, we first compare three modes of FIM, namely, EM-only, PBF-only, and EM-PBF, in terms of received signal power under different FIM and channel setups. The PBF-only mode, which only adjusts the reflect phase, is shown to be less effective than the EM-only mode in enhancing received signal strength. In a multi-element, multi-path scenario, the EM-only mode improves the received signal power by 125% compared to the PBF-only mode. The EM-PBF mode, which optimizes both element positions and phases, further enhances performance. Additionally, we investigate the channel estimation problem for FIM systems by designing a protocol that gathers EM and PBF measurements, enabling the formulation of a compressive sensing problem for joint cascaded and direct channel estimation. We then propose a sparse recovery algorithm called clustering mean-field variational sparse Bayesian learning, which enhances estimation performance while maintaining low complexity.
Abstract:We propose an efficient multi-view stereo (MVS) network for infering depth value from multiple RGB images. Recent studies have shown that mapping the geometric relationship in real space to neural network is an essential topic of the MVS problem. Specifically, these methods focus on how to express the correspondence between different views by constructing a nice cost volume. In this paper, we propose a more complete cost volume construction approach based on absorbing previous experience. First of all, we introduce the self-attention mechanism to fully aggregate the dominant information from input images and accurately model the long-range dependency, so as to selectively aggregate reference features. Secondly, we introduce the group-wise correlation to feature aggregation, which greatly reduces the memory and calculation burden. Meanwhile, this method enhances the information interaction between different feature channels. With this approach, a more lightweight and efficient cost volume is constructed. Finally we follow the coarse to fine strategy and refine the depth sampling range scale by scale with the help of uncertainty estimation. We further combine the previous steps to get the attention thin volume. Quantitative and qualitative experiments are presented to demonstrate the performance of our model.