Picture for Nuoya Xiong

Nuoya Xiong

A Correction of Pseudo Log-Likelihood Method

Add code
Mar 26, 2024
Viaarxiv icon

Sample-Efficient Multi-Agent RL: An Optimization Perspective

Add code
Oct 10, 2023
Viaarxiv icon

How Over-Parameterization Slows Down Gradient Descent in Matrix Sensing: The Curses of Symmetry and Initialization

Add code
Oct 09, 2023
Viaarxiv icon

A General Framework for Sequential Decision-Making under Adaptivity Constraints

Add code
Jun 27, 2023
Viaarxiv icon

Provably Safe Reinforcement Learning with Step-wise Violation Constraints

Add code
Feb 13, 2023
Viaarxiv icon

Combinatorial Causal Bandits without Graph Skeleton

Add code
Jan 31, 2023
Viaarxiv icon

Pure Exploration of Causal Bandits

Add code
Jun 16, 2022
Figure 1 for Pure Exploration of Causal Bandits
Figure 2 for Pure Exploration of Causal Bandits
Viaarxiv icon