Picture for Trevor McInroe

Trevor McInroe

Efficient Offline Reinforcement Learning: The Critic is Critical

Add code
Jun 19, 2024
Viaarxiv icon

LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots

Add code
Apr 22, 2024
Figure 1 for LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots
Figure 2 for LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots
Figure 3 for LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots
Figure 4 for LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots
Viaarxiv icon

Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning

Add code
Oct 09, 2023
Viaarxiv icon

Conditional Mutual Information for Disentangled Representations in Reinforcement Learning

Add code
May 23, 2023
Figure 1 for Conditional Mutual Information for Disentangled Representations in Reinforcement Learning
Figure 2 for Conditional Mutual Information for Disentangled Representations in Reinforcement Learning
Figure 3 for Conditional Mutual Information for Disentangled Representations in Reinforcement Learning
Figure 4 for Conditional Mutual Information for Disentangled Representations in Reinforcement Learning
Viaarxiv icon

Deep Reinforcement Learning for Multi-Agent Interaction

Add code
Aug 02, 2022
Viaarxiv icon

Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning

Add code
Jul 12, 2022
Figure 1 for Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Figure 2 for Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Figure 3 for Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Figure 4 for Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Viaarxiv icon

Learning Representations for Control with Hierarchical Forward Models

Add code
Jun 22, 2022
Figure 1 for Learning Representations for Control with Hierarchical Forward Models
Figure 2 for Learning Representations for Control with Hierarchical Forward Models
Figure 3 for Learning Representations for Control with Hierarchical Forward Models
Figure 4 for Learning Representations for Control with Hierarchical Forward Models
Viaarxiv icon

Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning

Add code
Oct 11, 2021
Figure 1 for Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning
Figure 2 for Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning
Figure 3 for Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning
Figure 4 for Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning
Viaarxiv icon