Picture for Peter Stone

Peter Stone

UT Austin, Sony AI

Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer

Add code
Dec 12, 2024
Viaarxiv icon

RL Zero: Zero-Shot Language to Behaviors without any Supervision

Add code
Dec 07, 2024
Viaarxiv icon

Proto Successor Measure: Representing the Space of All Possible Solutions of Reinforcement Learning

Add code
Nov 29, 2024
Viaarxiv icon

Learning Memory Mechanisms for Decision Making through Demonstrations

Add code
Nov 13, 2024
Viaarxiv icon

PACER: Preference-conditioned All-terrain Costmap Generation

Add code
Oct 30, 2024
Figure 1 for PACER: Preference-conditioned All-terrain Costmap Generation
Figure 2 for PACER: Preference-conditioned All-terrain Costmap Generation
Figure 3 for PACER: Preference-conditioned All-terrain Costmap Generation
Figure 4 for PACER: Preference-conditioned All-terrain Costmap Generation
Viaarxiv icon

SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions

Add code
Oct 24, 2024
Figure 1 for SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions
Figure 2 for SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions
Figure 3 for SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions
Figure 4 for SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions
Viaarxiv icon

Learning to Look: Seeking Information for Decision Making via Policy Factorization

Add code
Oct 24, 2024
Viaarxiv icon

Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning

Add code
Oct 15, 2024
Figure 1 for Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning
Figure 2 for Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning
Figure 3 for Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning
Figure 4 for Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning
Viaarxiv icon

SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning

Add code
Oct 13, 2024
Viaarxiv icon

Effort Allocation for Deadline-Aware Task and Motion Planning: A Metareasoning Approach

Add code
Oct 08, 2024
Viaarxiv icon