Picture for Ryo Iwaki

Ryo Iwaki

Distorted Distributional Policy Evaluation for Offline Reinforcement Learning

Add code
Jan 05, 2026
Viaarxiv icon

Mirror Descent Actor Critic via Bounded Advantage Learning

Add code
Feb 06, 2025
Figure 1 for Mirror Descent Actor Critic via Bounded Advantage Learning
Figure 2 for Mirror Descent Actor Critic via Bounded Advantage Learning
Figure 3 for Mirror Descent Actor Critic via Bounded Advantage Learning
Figure 4 for Mirror Descent Actor Critic via Bounded Advantage Learning
Viaarxiv icon

Situated GAIL: Multitask imitation using task-conditioned adversarial inverse reinforcement learning

Add code
Nov 01, 2019
Figure 1 for Situated GAIL: Multitask imitation using task-conditioned adversarial inverse reinforcement learning
Figure 2 for Situated GAIL: Multitask imitation using task-conditioned adversarial inverse reinforcement learning
Figure 3 for Situated GAIL: Multitask imitation using task-conditioned adversarial inverse reinforcement learning
Figure 4 for Situated GAIL: Multitask imitation using task-conditioned adversarial inverse reinforcement learning
Viaarxiv icon

On- and Off-Policy Monotonic Policy Improvement

Add code
Nov 01, 2017
Figure 1 for On- and Off-Policy Monotonic Policy Improvement
Viaarxiv icon