Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Quan Zhou

From Minimax Optimal Importance Sampling to Uniformly Ergodic Importance-tempered MCMC

Jun 23, 2025

Quan Zhou

Abstract:We make two closely related theoretical contributions to the use of importance sampling schemes. First, for independent sampling, we prove that the minimax optimal trial distribution coincides with the target if and only if the target distribution has no atom with probability greater than $1/2$, where "minimax" means that the worst-case asymptotic variance of the self-normalized importance sampling estimator is minimized. When a large atom exists, it should be downweighted by the trial distribution. A similar phenomenon holds for a continuous target distribution concentrated on a small set. Second, we argue that it is often advantageous to run the Metropolis--Hastings algorithm with a tempered stationary distribution, $\pi(x)^\beta$, and correct for the bias by importance weighting. The dynamics of this "importance-tempered" sampling scheme can be described by a continuous-time Markov chain. We prove that for one-dimensional targets with polynomial tails, $\pi(x) \propto (1 + |x|)^{-\gamma}$, this chain is uniformly ergodic if and only if $1/\gamma < \beta < (\gamma - 2)/\gamma$. These results suggest that for target distributions with light or polynomial tails of order $\gamma > 3$, importance tempering can improve the precision of time-average estimators and essentially eliminate the need for burn-in.

Via

Access Paper or Ask Questions

Representative Action Selection for Large Action-Space Meta-Bandits

May 23, 2025

Quan Zhou, Mark Kozdoba, Shie Mannor

Abstract:We study the problem of selecting a subset from a large action space shared by a family of bandits, with the goal of achieving performance nearly matching that of using the full action space. We assume that similar actions tend to have related payoffs, modeled by a Gaussian process. To exploit this structure, we propose a simple epsilon-net algorithm to select a representative subset. We provide theoretical guarantees for its performance and compare it empirically to Thompson Sampling and Upper Confidence Bound.

Via

Access Paper or Ask Questions

Restricted Spectral Gap Decomposition for Simulated Tempering Targeting Mixture Distributions

May 21, 2025

Jhanvi Garg, Krishna Balasubramanian, Quan Zhou

Abstract:Simulated tempering is a widely used strategy for sampling from multimodal distributions. In this paper, we consider simulated tempering combined with an arbitrary local Markov chain Monte Carlo sampler and present a new decomposition theorem that provides a lower bound on the restricted spectral gap of the algorithm for sampling from mixture distributions. By working with the restricted spectral gap, the applicability of our results is extended to broader settings such as when the usual spectral gap is difficult to bound or becomes degenerate. We demonstrate the application of our theoretical results by analyzing simulated tempering combined with random walk Metropolis--Hastings for sampling from mixtures of Gaussian distributions. We show that in fixed-dimensional settings, the algorithm's complexity scales polynomially with the separation between modes and logarithmically with $1/\varepsilon$, where $\varepsilon$ is the target accuracy in total variation distance.

* 36 pages, 1 figure, 1 table

Via

Access Paper or Ask Questions

RIS-Assisted Beamfocusing in Near-Field IoT Communication Systems: A Transformer-Based Approach

Apr 17, 2025

Quan Zhou, Jingjing Zhao, Kaiquan Cai, Yanbo Zhu

Abstract:The massive number of antennas in extremely large aperture array (ELAA) systems shifts the propagation regime of signals in internet of things (IoT) communication systems towards near-field spherical wave propagation. We propose a reconfigurable intelligent surfaces (RIS)-assisted beamfocusing mechanism, where the design of the two-dimensional beam codebook that contains both the angular and distance domains is challenging. To address this issue, we introduce a novel Transformer-based two-stage beam training algorithm, which includes the coarse and fine search phases. The proposed mechanism provides a fine-grained codebook with enhanced spatial resolution, enabling precise beamfocusing. Specifically, in the first stage, the beam training is performed to estimate the approximate location of the device by using a simple codebook, determining whether it is within the beamfocusing range (BFR) or the none-beamfocusing range (NBFR). In the second stage, by using a more precise codebook, a fine-grained beam search strategy is conducted. Experimental results unveil that the precision of the RIS-assisted beamfocusing is greatly improved. The proposed method achieves beam selection accuracy up to 97% at signal-to-noise ratio (SNR) of 20 dB, and improves 10% to 50% over the baseline method at different SNRs.

Via

Access Paper or Ask Questions

TDFANet: Encoding Sequential 4D Radar Point Clouds Using Trajectory-Guided Deformable Feature Aggregation for Place Recognition

Apr 07, 2025

Shouyi Lu, Guirong Zhuo, Haitao Wang, Quan Zhou, Huanyu Zhou, Renbo Huang, Minqing Huang, Lianqing Zheng, Qiang Shu

Abstract:Place recognition is essential for achieving closed-loop or global positioning in autonomous vehicles and mobile robots. Despite recent advancements in place recognition using 2D cameras or 3D LiDAR, it remains to be seen how to use 4D radar for place recognition - an increasingly popular sensor for its robustness against adverse weather and lighting conditions. Compared to LiDAR point clouds, radar data are drastically sparser, noisier and in much lower resolution, which hampers their ability to effectively represent scenes, posing significant challenges for 4D radar-based place recognition. This work addresses these challenges by leveraging multi-modal information from sequential 4D radar scans and effectively extracting and aggregating spatio-temporal features.Our approach follows a principled pipeline that comprises (1) dynamic points removal and ego-velocity estimation from velocity property, (2) bird's eye view (BEV) feature encoding on the refined point cloud, (3) feature alignment using BEV feature map motion trajectory calculated by ego-velocity, (4) multi-scale spatio-temporal features of the aligned BEV feature maps are extracted and aggregated.Real-world experimental results validate the feasibility of the proposed method and demonstrate its robustness in handling dynamic environments. Source codes are available.

* 8 pages, 4 figures. Accepted to ICRA 2025

Via

Access Paper or Ask Questions

Movable-Element RIS-Aided Wireless Communications: An Element-Wise Position Optimization Approach

Mar 19, 2025

Jingjing Zhao, Qingyi Huang, Kaiquan Cai, Quan Zhou, Xidong Mu, Yuanwei Liu

Abstract:A point-to-point movable element (ME) enabled reconfigurable intelligent surface (ME-RIS) communication system is investigated, where each element position can be flexibly adjusted to create favorable channel conditions. For maximizing the communication rate, an efficient ME position optimization approach is proposed. Specifically, by characterizing the cascaded channel power gain in an element-wise manner, the position of each ME is iteratively updated by invoking the successive convex approximation method. Numerical results unveil that 1) the proposed element-wise ME position optimization algorithm outperforms the gradient descent algorithm; and 2) the ME-RIS significantly improves the communication rate compared to the conventional RIS with fixed-position elements.

Via

Access Paper or Ask Questions

CaseGen: A Benchmark for Multi-Stage Legal Case Documents Generation

Feb 25, 2025

Haitao Li, Jiaying Ye, Yiran Hu, Jia Chen, Qingyao Ai, Yueyue Wu, Junjie Chen, Yifan Chen, Cheng Luo, Quan Zhou(+1 more)

Abstract:Legal case documents play a critical role in judicial proceedings. As the number of cases continues to rise, the reliance on manual drafting of legal case documents is facing increasing pressure and challenges. The development of large language models (LLMs) offers a promising solution for automating document generation. However, existing benchmarks fail to fully capture the complexities involved in drafting legal case documents in real-world scenarios. To address this gap, we introduce CaseGen, the benchmark for multi-stage legal case documents generation in the Chinese legal domain. CaseGen is based on 500 real case samples annotated by legal experts and covers seven essential case sections. It supports four key tasks: drafting defense statements, writing trial facts, composing legal reasoning, and generating judgment results. To the best of our knowledge, CaseGen is the first benchmark designed to evaluate LLMs in the context of legal case document generation. To ensure an accurate and comprehensive evaluation, we design the LLM-as-a-judge evaluation framework and validate its effectiveness through human annotations. We evaluate several widely used general-domain LLMs and legal-specific LLMs, highlighting their limitations in case document generation and pinpointing areas for potential improvement. This work marks a step toward a more effective framework for automating legal case documents drafting, paving the way for the reliable application of AI in the legal field. The dataset and code are publicly available at https://github.com/CSHaitao/CaseGen.

* 18 pages

Via

Access Paper or Ask Questions

YUNet: Improved YOLOv11 Network for Skyline Detection

Feb 18, 2025

Gang Yang, Miao Wang, Quan Zhou, Jiangchuan Li

Abstract:Skyline detection plays an important role in geolocalizaion, flight control, visual navigation, port security, etc. The appearance of the sky and non-sky areas are variable, because of different weather or illumination environment, which brings challenges to skyline detection. In this research, we proposed the YUNet algorithm, which improved the YOLOv11 architecture to segment the sky region and extract the skyline in complicated and variable circumstances. To improve the ability of multi-scale and large range contextual feature fusion, the YOLOv11 architecture is extended as an UNet-like architecture, consisting of an encoder, neck and decoder submodule. The encoder extracts the multi-scale features from the given images. The neck makes fusion of these multi-scale features. The decoder applies the fused features to complete the prediction rebuilding. To validate the proposed approach, the YUNet was tested on Skyfinder and CH1 datasets for segmentation and skyline detection respectively. Our test shows that the IoU of YUnet segmentation can reach 0.9858, and the average error of YUnet skyline detection is just 1.36 pixels. The implementation is published at https://github.com/kuazhangxiaoai/SkylineDet-YOLOv11Seg.git.

Via

Access Paper or Ask Questions

Exploiting Movable-Element STARS for Wireless Communications

Dec 28, 2024

Jingjing Zhao, Quan Zhou, Xidong Mu, Kaiquan Cai, Yanbo Zhu, Yuanwei Liu

Abstract:A novel movable-element enabled simultaneously transmitting and reflecting surface (ME-STARS) communication system is proposed, where ME-STARS elements positions can be adjusted to enhance the degress-of-freedom for transmission and reflection. For each ME-STARS operating protocols, namely energy-splitting (ES), mode switching (MS), and time switching (TS), a weighted sum rate (WSR) maximization problem is formulated to jointly optimize the active beamforming at the base station (BS) as well as the elements positions and passive beamforming at the ME-STARS. An alternative optimization (AO)-based iterative algorithm is developed to decompose the original non-convex problem into three subproblems. Specifically, the gradient descent algorithm is employed for solving the ME-STARS element position optimization subproblem, and the weighted minimum mean square error and the successive convex approximation methods are invoked for solving the active and passive beamforming subproblems, respectively. It is further demonstrated that the proposed AO algorithm for ES can be extended to solve the problems for MS and TS. Numerical results unveil that: 1) the ME-STARS can significantly improve the WSR compared to the STARS with fixed position elements and the conventional reconfigurable intelligent surface with movable elements, thanks to the extra spatial-domain diversity and the higher flexibility in beamforming; and 2) the performance gain of ME-STARS is significant in the scenarios with larger number of users or more scatterers.

Via

Access Paper or Ask Questions

Remote Manipulation of Multiple Objects with Airflow Field Using Model-Based Learning Control

Dec 04, 2024

Artur Kopitca, Shahriar Haeri, Quan Zhou

Figure 1 for Remote Manipulation of Multiple Objects with Airflow Field Using Model-Based Learning Control

Figure 2 for Remote Manipulation of Multiple Objects with Airflow Field Using Model-Based Learning Control

Figure 3 for Remote Manipulation of Multiple Objects with Airflow Field Using Model-Based Learning Control

Figure 4 for Remote Manipulation of Multiple Objects with Airflow Field Using Model-Based Learning Control

Abstract:Non-contact manipulation is an emerging and highly promising methodology in robotics, offering a wide range of scientific and industrial applications. Among the proposed approaches, airflow stands out for its ability to project across considerable distances and its flexibility in actuating objects of varying materials, sizes, and shapes. However, predicting airflow fields at a distance, as well as the motion of objects within them, remains notoriously challenging due to their nonlinear and stochastic nature. Here, we propose a model-based learning approach using a jet-induced airflow field for remote multi-object manipulation on a surface. Our approach incorporates an analytical model of the field, learned object dynamics, and a model-based controller. The model predicts an air velocity field over an infinite surface for a specified jet orientation, while the object dynamics are learned through a robust system identification algorithm. Using the model-based controller, we can automatically and remotely, at meter-scale distances, control the motion of single and multiple objects for different tasks, such as path-following, aggregating, and sorting.

* 8 pages, 7 figures

Via

Access Paper or Ask Questions