Picture for Xuyang Chen

Xuyang Chen

Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach

Add code
May 08, 2025
Viaarxiv icon

Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic Regulator

Add code
May 02, 2025
Viaarxiv icon

VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning

Add code
Apr 16, 2025
Viaarxiv icon

The Communication and Computation Trade-off in Wireless Semantic Communications

Add code
Apr 14, 2025
Viaarxiv icon

Preference-Guided Reinforcement Learning for Efficient Exploration

Add code
Jul 09, 2024
Figure 1 for Preference-Guided Reinforcement Learning for Efficient Exploration
Figure 2 for Preference-Guided Reinforcement Learning for Efficient Exploration
Figure 3 for Preference-Guided Reinforcement Learning for Efficient Exploration
Figure 4 for Preference-Guided Reinforcement Learning for Efficient Exploration
Viaarxiv icon

Contrastive Learning of Shared Spatiotemporal EEG Representations Across Individuals for Naturalistic Neuroscience

Add code
Feb 22, 2024
Figure 1 for Contrastive Learning of Shared Spatiotemporal EEG Representations Across Individuals for Naturalistic Neuroscience
Figure 2 for Contrastive Learning of Shared Spatiotemporal EEG Representations Across Individuals for Naturalistic Neuroscience
Figure 3 for Contrastive Learning of Shared Spatiotemporal EEG Representations Across Individuals for Naturalistic Neuroscience
Figure 4 for Contrastive Learning of Shared Spatiotemporal EEG Representations Across Individuals for Naturalistic Neuroscience
Viaarxiv icon

Optimization Landscape of Policy Gradient Methods for Discrete-time Static Output Feedback

Add code
Oct 29, 2023
Figure 1 for Optimization Landscape of Policy Gradient Methods for Discrete-time Static Output Feedback
Figure 2 for Optimization Landscape of Policy Gradient Methods for Discrete-time Static Output Feedback
Figure 3 for Optimization Landscape of Policy Gradient Methods for Discrete-time Static Output Feedback
Viaarxiv icon

Box2Poly: Memory-Efficient Polygon Prediction of Arbitrarily Shaped and Rotated Text

Add code
Sep 20, 2023
Figure 1 for Box2Poly: Memory-Efficient Polygon Prediction of Arbitrarily Shaped and Rotated Text
Figure 2 for Box2Poly: Memory-Efficient Polygon Prediction of Arbitrarily Shaped and Rotated Text
Figure 3 for Box2Poly: Memory-Efficient Polygon Prediction of Arbitrarily Shaped and Rotated Text
Figure 4 for Box2Poly: Memory-Efficient Polygon Prediction of Arbitrarily Shaped and Rotated Text
Viaarxiv icon

Trust-Region Neural Moving Horizon Estimation for Robots

Add code
Sep 19, 2023
Viaarxiv icon

Efficient Q-Learning over Visit Frequency Maps for Multi-agent Exploration of Unknown Environments

Add code
Jul 30, 2023
Viaarxiv icon