Picture for Eiji Uchibe

Eiji Uchibe

Evaluation of Best-of-N Sampling Strategies for Language Model Alignment

Add code
Feb 18, 2025
Viaarxiv icon

Theoretical Guarantees for Minimum Bayes Risk Decoding

Add code
Feb 18, 2025
Viaarxiv icon

Unsupervised Neural Motion Retargeting for Humanoid Teleoperation

Add code
Jun 02, 2024
Figure 1 for Unsupervised Neural Motion Retargeting for Humanoid Teleoperation
Figure 2 for Unsupervised Neural Motion Retargeting for Humanoid Teleoperation
Figure 3 for Unsupervised Neural Motion Retargeting for Humanoid Teleoperation
Figure 4 for Unsupervised Neural Motion Retargeting for Humanoid Teleoperation
Viaarxiv icon

Reward-Punishment Reinforcement Learning with Maximum Entropy

Add code
May 20, 2024
Figure 1 for Reward-Punishment Reinforcement Learning with Maximum Entropy
Figure 2 for Reward-Punishment Reinforcement Learning with Maximum Entropy
Figure 3 for Reward-Punishment Reinforcement Learning with Maximum Entropy
Figure 4 for Reward-Punishment Reinforcement Learning with Maximum Entropy
Viaarxiv icon

Randomized-to-Canonical Model Predictive Control for Real-world Visual Robotic Manipulation

Add code
Jul 05, 2022
Figure 1 for Randomized-to-Canonical Model Predictive Control for Real-world Visual Robotic Manipulation
Figure 2 for Randomized-to-Canonical Model Predictive Control for Real-world Visual Robotic Manipulation
Figure 3 for Randomized-to-Canonical Model Predictive Control for Real-world Visual Robotic Manipulation
Figure 4 for Randomized-to-Canonical Model Predictive Control for Real-world Visual Robotic Manipulation
Viaarxiv icon

Model-Based Imitation Learning Using Entropy Regularization of Model and Policy

Add code
Jun 21, 2022
Figure 1 for Model-Based Imitation Learning Using Entropy Regularization of Model and Policy
Figure 2 for Model-Based Imitation Learning Using Entropy Regularization of Model and Policy
Figure 3 for Model-Based Imitation Learning Using Entropy Regularization of Model and Policy
Figure 4 for Model-Based Imitation Learning Using Entropy Regularization of Model and Policy
Viaarxiv icon

$q$-Munchausen Reinforcement Learning

Add code
May 16, 2022
Figure 1 for $q$-Munchausen Reinforcement Learning
Figure 2 for $q$-Munchausen Reinforcement Learning
Figure 3 for $q$-Munchausen Reinforcement Learning
Figure 4 for $q$-Munchausen Reinforcement Learning
Viaarxiv icon

Enforcing KL Regularization in General Tsallis Entropy Reinforcement Learning via Advantage Learning

Add code
May 16, 2022
Figure 1 for Enforcing KL Regularization in General Tsallis Entropy Reinforcement Learning via Advantage Learning
Figure 2 for Enforcing KL Regularization in General Tsallis Entropy Reinforcement Learning via Advantage Learning
Figure 3 for Enforcing KL Regularization in General Tsallis Entropy Reinforcement Learning via Advantage Learning
Figure 4 for Enforcing KL Regularization in General Tsallis Entropy Reinforcement Learning via Advantage Learning
Viaarxiv icon

Imitation learning based on entropy-regularized forward and inverse reinforcement learning

Add code
Aug 17, 2020
Figure 1 for Imitation learning based on entropy-regularized forward and inverse reinforcement learning
Figure 2 for Imitation learning based on entropy-regularized forward and inverse reinforcement learning
Figure 3 for Imitation learning based on entropy-regularized forward and inverse reinforcement learning
Figure 4 for Imitation learning based on entropy-regularized forward and inverse reinforcement learning
Viaarxiv icon

Unbounded Output Networks for Classification

Add code
Jul 25, 2018
Figure 1 for Unbounded Output Networks for Classification
Figure 2 for Unbounded Output Networks for Classification
Figure 3 for Unbounded Output Networks for Classification
Figure 4 for Unbounded Output Networks for Classification
Viaarxiv icon