Picture for Tadashi Kozuno

Tadashi Kozuno

Symmetry-Breaking in Multi-Agent Navigation: Winding Number-Aware MPC with a Learned Topological Strategy

Add code
Nov 19, 2025
Viaarxiv icon

Self Iterative Label Refinement via Robust Unlabeled Learning

Add code
Feb 18, 2025
Viaarxiv icon

Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form

Add code
Sep 02, 2024
Viaarxiv icon

Symmetry-aware Reinforcement Learning for Robotic Assembly under Partial Observability with a Soft Wrist

Add code
Feb 28, 2024
Figure 1 for Symmetry-aware Reinforcement Learning for Robotic Assembly under Partial Observability with a Soft Wrist
Figure 2 for Symmetry-aware Reinforcement Learning for Robotic Assembly under Partial Observability with a Soft Wrist
Figure 3 for Symmetry-aware Reinforcement Learning for Robotic Assembly under Partial Observability with a Soft Wrist
Figure 4 for Symmetry-aware Reinforcement Learning for Robotic Assembly under Partial Observability with a Soft Wrist
Viaarxiv icon

A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees

Add code
Feb 02, 2024
Figure 1 for A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees
Figure 2 for A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees
Viaarxiv icon

Multi-Agent Behavior Retrieval

Add code
Dec 04, 2023
Viaarxiv icon

Local and adaptive mirror descents in extensive-form games

Add code
Sep 01, 2023
Figure 1 for Local and adaptive mirror descents in extensive-form games
Viaarxiv icon

DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm

Add code
May 29, 2023
Figure 1 for DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
Figure 2 for DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
Figure 3 for DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
Figure 4 for DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
Viaarxiv icon

Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice

Add code
May 22, 2023
Viaarxiv icon

Counterfactual Fairness Filter for Fair-Delay Multi-Robot Navigation

Add code
May 19, 2023
Figure 1 for Counterfactual Fairness Filter for Fair-Delay Multi-Robot Navigation
Figure 2 for Counterfactual Fairness Filter for Fair-Delay Multi-Robot Navigation
Figure 3 for Counterfactual Fairness Filter for Fair-Delay Multi-Robot Navigation
Figure 4 for Counterfactual Fairness Filter for Fair-Delay Multi-Robot Navigation
Viaarxiv icon