Picture for Qin Lu

Qin Lu

Scalable Variational Bayesian Fine-Tuning of LLMs via Orthogonalized Low-Rank Adapters

Add code
Apr 03, 2026
Viaarxiv icon

Diffusion Policy with Bayesian Expert Selection for Active Multi-Target Tracking

Add code
Apr 03, 2026
Viaarxiv icon

Approximation of Log-Partition Function in Policy Mirror Descent Induces Implicit Regularization for LLM Post-Training

Add code
Feb 05, 2026
Viaarxiv icon

Ask a Strong LLM Judge when Your Reward Model is Uncertain

Add code
Oct 23, 2025
Viaarxiv icon

Fine-tuning LLMs with variational Bayesian last layer for high-dimensional Bayesian optimzation

Add code
Oct 01, 2025
Viaarxiv icon

Deploying AI for Signal Processing education: Selected challenges and intriguing opportunities

Add code
Sep 10, 2025
Viaarxiv icon

WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

Add code
May 22, 2025
Viaarxiv icon

Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models

Add code
May 22, 2025
Viaarxiv icon

Bayesian Optimization for Robust Identification of Ornstein-Uhlenbeck Model

Add code
Mar 09, 2025
Figure 1 for Bayesian Optimization for Robust Identification of Ornstein-Uhlenbeck Model
Figure 2 for Bayesian Optimization for Robust Identification of Ornstein-Uhlenbeck Model
Figure 3 for Bayesian Optimization for Robust Identification of Ornstein-Uhlenbeck Model
Figure 4 for Bayesian Optimization for Robust Identification of Ornstein-Uhlenbeck Model
Viaarxiv icon

Online scalable Gaussian processes with conformal prediction for guaranteed coverage

Add code
Oct 07, 2024
Figure 1 for Online scalable Gaussian processes with conformal prediction for guaranteed coverage
Figure 2 for Online scalable Gaussian processes with conformal prediction for guaranteed coverage
Figure 3 for Online scalable Gaussian processes with conformal prediction for guaranteed coverage
Viaarxiv icon