Picture for Taisuke Kobayashi

Taisuke Kobayashi

Weber-Fechner Law in Temporal Difference learning derived from Control as Inference

Add code
Dec 30, 2024
Viaarxiv icon

Design of Restricted Normalizing Flow towards Arbitrary Stochastic Policy with Computational Efficiency

Add code
Dec 17, 2024
Viaarxiv icon

DROP: Distributional and Regular Optimism and Pessimism for Reinforcement Learning

Add code
Oct 22, 2024
Viaarxiv icon

Domains as Objectives: Domain-Uncertainty-Aware Policy Optimization through Explicit Multi-Domain Convex Coverage Set Learning

Add code
Oct 07, 2024
Viaarxiv icon

LiRA: Light-Robust Adversary for Model-based Reinforcement Learning in Real World

Add code
Sep 29, 2024
Viaarxiv icon

Revisiting Experience Replayable Conditions

Add code
Feb 15, 2024
Viaarxiv icon

Intentionally-underestimated Value Function at Terminal State for Temporal-difference Learning with Mis-designed Reward

Add code
Aug 24, 2023
Viaarxiv icon

Soft Actor-Critic Algorithm with Truly Inequality Constraint

Add code
Mar 08, 2023
Viaarxiv icon

Reward Bonuses with Gain Scheduling Inspired by Iterative Deepening Search

Add code
Dec 21, 2022
Viaarxiv icon

Real-time Sampling-based Model Predictive Control based on Reverse Kullback-Leibler Divergence and Its Adaptive Acceleration

Add code
Dec 08, 2022
Viaarxiv icon