Picture for Xiao-Yue Gong

Xiao-Yue Gong

Asymptotically Optimal Pure Exploration for Infinite-Armed Bandits

Add code
Jun 03, 2023
Viaarxiv icon

Provably More Efficient Q-Learning in the Full-Feedback/One-Sided-Feedback Settings

Add code
Jun 30, 2020
Figure 1 for Provably More Efficient Q-Learning in the Full-Feedback/One-Sided-Feedback Settings
Figure 2 for Provably More Efficient Q-Learning in the Full-Feedback/One-Sided-Feedback Settings
Figure 3 for Provably More Efficient Q-Learning in the Full-Feedback/One-Sided-Feedback Settings
Figure 4 for Provably More Efficient Q-Learning in the Full-Feedback/One-Sided-Feedback Settings
Viaarxiv icon

Efficient Entropy for Policy Gradient with Multidimensional Action Space

Add code
Jun 02, 2018
Figure 1 for Efficient Entropy for Policy Gradient with Multidimensional Action Space
Figure 2 for Efficient Entropy for Policy Gradient with Multidimensional Action Space
Figure 3 for Efficient Entropy for Policy Gradient with Multidimensional Action Space
Figure 4 for Efficient Entropy for Policy Gradient with Multidimensional Action Space
Viaarxiv icon