Picture for Trevor McInroe

Trevor McInroe

Efficient Offline Reinforcement Learning: The Critic is Critical

Add code
Jun 19, 2024
Viaarxiv icon

LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots

Add code
Apr 22, 2024
Viaarxiv icon

Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning

Add code
Oct 09, 2023
Viaarxiv icon

Conditional Mutual Information for Disentangled Representations in Reinforcement Learning

Add code
May 23, 2023
Viaarxiv icon

Deep Reinforcement Learning for Multi-Agent Interaction

Add code
Aug 02, 2022
Viaarxiv icon

Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning

Add code
Jul 12, 2022
Figure 1 for Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Figure 2 for Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Figure 3 for Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Figure 4 for Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Viaarxiv icon

Learning Representations for Control with Hierarchical Forward Models

Add code
Jun 22, 2022
Figure 1 for Learning Representations for Control with Hierarchical Forward Models
Figure 2 for Learning Representations for Control with Hierarchical Forward Models
Figure 3 for Learning Representations for Control with Hierarchical Forward Models
Figure 4 for Learning Representations for Control with Hierarchical Forward Models
Viaarxiv icon

Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning

Add code
Oct 11, 2021
Figure 1 for Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning
Figure 2 for Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning
Figure 3 for Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning
Figure 4 for Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning
Viaarxiv icon