Picture for Peihong Yu

Peihong Yu

VARP: Reinforcement Learning from Vision-Language Model Feedback with Agent Regularized Preferences

Add code
Mar 18, 2025
Viaarxiv icon

Sketch-to-Skill: Bootstrapping Robot Learning with Human Drawn Trajectory Sketches

Add code
Mar 14, 2025
Viaarxiv icon

On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning

Add code
Oct 05, 2024
Figure 1 for On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning
Figure 2 for On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning
Figure 3 for On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning
Figure 4 for On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning
Viaarxiv icon

Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning

Add code
Mar 13, 2024
Figure 1 for Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning
Figure 2 for Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning
Figure 3 for Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning
Figure 4 for Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning
Viaarxiv icon

Enhancing Multi-Agent Coordination through Common Operating Picture Integration

Add code
Nov 08, 2023
Figure 1 for Enhancing Multi-Agent Coordination through Common Operating Picture Integration
Figure 2 for Enhancing Multi-Agent Coordination through Common Operating Picture Integration
Figure 3 for Enhancing Multi-Agent Coordination through Common Operating Picture Integration
Figure 4 for Enhancing Multi-Agent Coordination through Common Operating Picture Integration
Viaarxiv icon

Insta-RS: Instance-wise Randomized Smoothing for Improved Robustness and Accuracy

Add code
Mar 21, 2021
Figure 1 for Insta-RS: Instance-wise Randomized Smoothing for Improved Robustness and Accuracy
Figure 2 for Insta-RS: Instance-wise Randomized Smoothing for Improved Robustness and Accuracy
Figure 3 for Insta-RS: Instance-wise Randomized Smoothing for Improved Robustness and Accuracy
Figure 4 for Insta-RS: Instance-wise Randomized Smoothing for Improved Robustness and Accuracy
Viaarxiv icon