Picture for Takuya Hiraoka

Takuya Hiraoka

Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences

Add code
May 23, 2024
Viaarxiv icon

Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and Regularization

Add code
Dec 10, 2023
Viaarxiv icon

Unsupervised Discovery of Continuous Skills on a Sphere

Add code
May 25, 2023
Viaarxiv icon

Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over Dropout

Add code
Jan 26, 2023
Viaarxiv icon

Dropout Q-Functions for Doubly Efficient Reinforcement Learning

Add code
Oct 05, 2021
Figure 1 for Dropout Q-Functions for Doubly Efficient Reinforcement Learning
Figure 2 for Dropout Q-Functions for Doubly Efficient Reinforcement Learning
Figure 3 for Dropout Q-Functions for Doubly Efficient Reinforcement Learning
Figure 4 for Dropout Q-Functions for Doubly Efficient Reinforcement Learning
Viaarxiv icon

Off-Policy Meta-Reinforcement Learning Based on Feature Embedding Spaces

Add code
Jan 06, 2021
Figure 1 for Off-Policy Meta-Reinforcement Learning Based on Feature Embedding Spaces
Figure 2 for Off-Policy Meta-Reinforcement Learning Based on Feature Embedding Spaces
Figure 3 for Off-Policy Meta-Reinforcement Learning Based on Feature Embedding Spaces
Figure 4 for Off-Policy Meta-Reinforcement Learning Based on Feature Embedding Spaces
Viaarxiv icon

Meta-Model-Based Meta-Policy Optimization

Add code
Jun 05, 2020
Figure 1 for Meta-Model-Based Meta-Policy Optimization
Figure 2 for Meta-Model-Based Meta-Policy Optimization
Figure 3 for Meta-Model-Based Meta-Policy Optimization
Figure 4 for Meta-Model-Based Meta-Policy Optimization
Viaarxiv icon

Optimistic Proximal Policy Optimization

Add code
Jun 25, 2019
Figure 1 for Optimistic Proximal Policy Optimization
Figure 2 for Optimistic Proximal Policy Optimization
Figure 3 for Optimistic Proximal Policy Optimization
Figure 4 for Optimistic Proximal Policy Optimization
Viaarxiv icon

Learning Robust Options by Conditional Value at Risk Optimization

Add code
Jun 11, 2019
Figure 1 for Learning Robust Options by Conditional Value at Risk Optimization
Figure 2 for Learning Robust Options by Conditional Value at Risk Optimization
Figure 3 for Learning Robust Options by Conditional Value at Risk Optimization
Figure 4 for Learning Robust Options by Conditional Value at Risk Optimization
Viaarxiv icon

Optimization of Information-Seeking Dialogue Strategy for Argumentation-Based Dialogue System

Add code
Nov 26, 2018
Figure 1 for Optimization of Information-Seeking Dialogue Strategy for Argumentation-Based Dialogue System
Figure 2 for Optimization of Information-Seeking Dialogue Strategy for Argumentation-Based Dialogue System
Figure 3 for Optimization of Information-Seeking Dialogue Strategy for Argumentation-Based Dialogue System
Figure 4 for Optimization of Information-Seeking Dialogue Strategy for Argumentation-Based Dialogue System
Viaarxiv icon