Picture for Qinghua Liu

Qinghua Liu

Learning to Achieve Goals with Belief State Transformers

Add code
Oct 30, 2024
Figure 1 for Learning to Achieve Goals with Belief State Transformers
Figure 2 for Learning to Achieve Goals with Belief State Transformers
Figure 3 for Learning to Achieve Goals with Belief State Transformers
Figure 4 for Learning to Achieve Goals with Belief State Transformers
Viaarxiv icon

On Limitation of Transformer for Learning HMMs

Add code
Jun 06, 2024
Viaarxiv icon

PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network

Add code
Sep 13, 2023
Viaarxiv icon

Is RLHF More Difficult than Standard RL?

Add code
Jun 25, 2023
Viaarxiv icon

Context-lumpable stochastic bandits

Add code
Jun 22, 2023
Viaarxiv icon

Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL

Add code
May 18, 2023
Figure 1 for Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL
Viaarxiv icon

Breaking the Curse of Multiagency: Provably Efficient Decentralized Multi-Agent RL with Function Approximation

Add code
Mar 02, 2023
Viaarxiv icon

Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making

Add code
Sep 29, 2022
Figure 1 for Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making
Figure 2 for Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making
Viaarxiv icon

Dive into Big Model Training

Add code
Jul 25, 2022
Figure 1 for Dive into Big Model Training
Figure 2 for Dive into Big Model Training
Figure 3 for Dive into Big Model Training
Figure 4 for Dive into Big Model Training
Viaarxiv icon

A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games

Add code
Jul 18, 2022
Figure 1 for A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games
Figure 2 for A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games
Figure 3 for A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games
Figure 4 for A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games
Viaarxiv icon