Picture for Guojian Wang

Guojian Wang

VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning

Add code
Apr 16, 2025
Viaarxiv icon

Preference-Guided Reinforcement Learning for Efficient Exploration

Add code
Jul 09, 2024
Figure 1 for Preference-Guided Reinforcement Learning for Efficient Exploration
Figure 2 for Preference-Guided Reinforcement Learning for Efficient Exploration
Figure 3 for Preference-Guided Reinforcement Learning for Efficient Exploration
Figure 4 for Preference-Guided Reinforcement Learning for Efficient Exploration
Viaarxiv icon

Learning Diverse Policies with Soft Self-Generated Guidance

Add code
Feb 07, 2024
Figure 1 for Learning Diverse Policies with Soft Self-Generated Guidance
Figure 2 for Learning Diverse Policies with Soft Self-Generated Guidance
Figure 3 for Learning Diverse Policies with Soft Self-Generated Guidance
Figure 4 for Learning Diverse Policies with Soft Self-Generated Guidance
Viaarxiv icon

Trajectory-Oriented Policy Optimization with Sparse Rewards

Add code
Jan 04, 2024
Figure 1 for Trajectory-Oriented Policy Optimization with Sparse Rewards
Figure 2 for Trajectory-Oriented Policy Optimization with Sparse Rewards
Figure 3 for Trajectory-Oriented Policy Optimization with Sparse Rewards
Figure 4 for Trajectory-Oriented Policy Optimization with Sparse Rewards
Viaarxiv icon

Policy Optimization with Smooth Guidance Rewards Learned from Sparse-Reward Demonstrations

Add code
Dec 30, 2023
Figure 1 for Policy Optimization with Smooth Guidance Rewards Learned from Sparse-Reward Demonstrations
Figure 2 for Policy Optimization with Smooth Guidance Rewards Learned from Sparse-Reward Demonstrations
Figure 3 for Policy Optimization with Smooth Guidance Rewards Learned from Sparse-Reward Demonstrations
Figure 4 for Policy Optimization with Smooth Guidance Rewards Learned from Sparse-Reward Demonstrations
Viaarxiv icon

Adaptive trajectory-constrained exploration strategy for deep reinforcement learning

Add code
Dec 27, 2023
Figure 1 for Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
Figure 2 for Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
Figure 3 for Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
Figure 4 for Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
Viaarxiv icon