Picture for Linjiajie Fang

Linjiajie Fang

Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model

Add code
Oct 27, 2024
Figure 1 for Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Figure 2 for Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Figure 3 for Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Figure 4 for Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Viaarxiv icon

Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning

Add code
May 31, 2024
Viaarxiv icon

Enhanced Bayesian Personalized Ranking for Robust Hard Negative Sampling in Recommender Systems

Add code
Mar 28, 2024
Viaarxiv icon