Picture for Tadashi Kozuno

Tadashi Kozuno

Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form

Add code
Sep 02, 2024
Viaarxiv icon

Symmetry-aware Reinforcement Learning for Robotic Assembly under Partial Observability with a Soft Wrist

Add code
Feb 28, 2024
Viaarxiv icon

A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees

Add code
Feb 02, 2024
Viaarxiv icon

Multi-Agent Behavior Retrieval

Add code
Dec 04, 2023
Viaarxiv icon

Local and adaptive mirror descents in extensive-form games

Add code
Sep 01, 2023
Viaarxiv icon

DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm

Add code
May 29, 2023
Figure 1 for DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
Figure 2 for DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
Figure 3 for DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
Figure 4 for DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
Viaarxiv icon

Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice

Add code
May 22, 2023
Viaarxiv icon

Counterfactual Fairness Filter for Fair-Delay Multi-Robot Navigation

Add code
May 19, 2023
Viaarxiv icon

When to Replan? An Adaptive Replanning Strategy for Autonomous Navigation using Deep Reinforcement Learning

Add code
Apr 24, 2023
Viaarxiv icon

Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints

Add code
Apr 18, 2023
Viaarxiv icon