Picture for Qinqing Zheng

Qinqing Zheng

Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback

Add code
Oct 30, 2024
Viaarxiv icon

Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces

Add code
Oct 13, 2024
Viaarxiv icon

Diffusion World Model

Add code
Feb 11, 2024
Figure 1 for Diffusion World Model
Figure 2 for Diffusion World Model
Figure 3 for Diffusion World Model
Figure 4 for Diffusion World Model
Viaarxiv icon

Guided Flows for Generative Modeling and Decision Making

Add code
Dec 07, 2023
Viaarxiv icon

Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories

Add code
Oct 12, 2022
Figure 1 for Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
Figure 2 for Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
Figure 3 for Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
Figure 4 for Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
Viaarxiv icon

ConserWeightive Behavioral Cloning for Reliable Offline Reinforcement Learning

Add code
Oct 11, 2022
Figure 1 for ConserWeightive Behavioral Cloning for Reliable Offline Reinforcement Learning
Figure 2 for ConserWeightive Behavioral Cloning for Reliable Offline Reinforcement Learning
Figure 3 for ConserWeightive Behavioral Cloning for Reliable Offline Reinforcement Learning
Figure 4 for ConserWeightive Behavioral Cloning for Reliable Offline Reinforcement Learning
Viaarxiv icon

Latent State Marginalization as a Low-cost Approach for Improving Exploration

Add code
Oct 03, 2022
Figure 1 for Latent State Marginalization as a Low-cost Approach for Improving Exploration
Figure 2 for Latent State Marginalization as a Low-cost Approach for Improving Exploration
Figure 3 for Latent State Marginalization as a Low-cost Approach for Improving Exploration
Figure 4 for Latent State Marginalization as a Low-cost Approach for Improving Exploration
Viaarxiv icon

Online Decision Transformer

Add code
Feb 11, 2022
Figure 1 for Online Decision Transformer
Figure 2 for Online Decision Transformer
Figure 3 for Online Decision Transformer
Figure 4 for Online Decision Transformer
Viaarxiv icon

A Theorem of the Alternative for Personalized Federated Learning

Add code
Mar 02, 2021
Viaarxiv icon

Federated $f$-Differential Privacy

Add code
Feb 22, 2021
Figure 1 for Federated $f$-Differential Privacy
Figure 2 for Federated $f$-Differential Privacy
Figure 3 for Federated $f$-Differential Privacy
Figure 4 for Federated $f$-Differential Privacy
Viaarxiv icon