Picture for Taisuke Kobayashi

Taisuke Kobayashi

DROP: Distributional and Regular Optimism and Pessimism for Reinforcement Learning

Add code
Oct 22, 2024
Viaarxiv icon

Domains as Objectives: Domain-Uncertainty-Aware Policy Optimization through Explicit Multi-Domain Convex Coverage Set Learning

Add code
Oct 07, 2024
Viaarxiv icon

LiRA: Light-Robust Adversary for Model-based Reinforcement Learning in Real World

Add code
Sep 29, 2024
Viaarxiv icon

Revisiting Experience Replayable Conditions

Add code
Feb 15, 2024
Viaarxiv icon

Intentionally-underestimated Value Function at Terminal State for Temporal-difference Learning with Mis-designed Reward

Add code
Aug 24, 2023
Viaarxiv icon

Soft Actor-Critic Algorithm with Truly Inequality Constraint

Add code
Mar 08, 2023
Viaarxiv icon

Reward Bonuses with Gain Scheduling Inspired by Iterative Deepening Search

Add code
Dec 21, 2022
Viaarxiv icon

Real-time Sampling-based Model Predictive Control based on Reverse Kullback-Leibler Divergence and Its Adaptive Acceleration

Add code
Dec 08, 2022
Viaarxiv icon

Sparse Representation Learning with Modified q-VAE towards Minimal Realization of World Model

Add code
Aug 08, 2022
Viaarxiv icon

Proximal Policy Optimization with Adaptive Threshold for Symmetric Relative Density Ratio

Add code
Mar 18, 2022
Figure 1 for Proximal Policy Optimization with Adaptive Threshold for Symmetric Relative Density Ratio
Figure 2 for Proximal Policy Optimization with Adaptive Threshold for Symmetric Relative Density Ratio
Figure 3 for Proximal Policy Optimization with Adaptive Threshold for Symmetric Relative Density Ratio
Figure 4 for Proximal Policy Optimization with Adaptive Threshold for Symmetric Relative Density Ratio
Viaarxiv icon