Picture for Ruiquan Huang

Ruiquan Huang

Augmenting Online RL with Offline Data is All You Need: A Unified Hybrid RL Algorithm Design and Analysis

Add code
May 19, 2025
Viaarxiv icon

How Transformers Learn Regular Language Recognition: A Theoretical Study on Training Dynamics and Implicit Bias

Add code
May 02, 2025
Viaarxiv icon

Robust Offline Reinforcement Learning for Non-Markovian Decision Processes

Add code
Nov 12, 2024
Viaarxiv icon

Federated Online Prediction from Experts with Differential Privacy: Separations and Regret Speed-ups

Add code
Sep 27, 2024
Viaarxiv icon

Non-asymptotic Convergence of Training Transformers for Next-token Prediction

Add code
Sep 25, 2024
Figure 1 for Non-asymptotic Convergence of Training Transformers for Next-token Prediction
Viaarxiv icon

Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes

Add code
Oct 20, 2023
Viaarxiv icon

Temporal-Distributed Backdoor Attack Against Video Based Action Recognition

Add code
Sep 01, 2023
Figure 1 for Temporal-Distributed Backdoor Attack Against Video Based Action Recognition
Figure 2 for Temporal-Distributed Backdoor Attack Against Video Based Action Recognition
Figure 3 for Temporal-Distributed Backdoor Attack Against Video Based Action Recognition
Figure 4 for Temporal-Distributed Backdoor Attack Against Video Based Action Recognition
Viaarxiv icon

Provably Efficient UCB-type Algorithms For Learning Predictive State Representations

Add code
Jul 01, 2023
Viaarxiv icon

Differentially Private Wireless Federated Learning Using Orthogonal Sequences

Add code
Jun 14, 2023
Viaarxiv icon

Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise Constraints

Add code
Jun 09, 2023
Viaarxiv icon