Picture for Miguel Suau

Miguel Suau

Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL

Add code
Jun 04, 2023
Viaarxiv icon

Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems

Add code
Jul 01, 2022
Figure 1 for Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Figure 2 for Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Figure 3 for Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Figure 4 for Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Viaarxiv icon

Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems

Add code
Feb 03, 2022
Figure 1 for Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems
Figure 2 for Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems
Figure 3 for Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems
Figure 4 for Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems
Viaarxiv icon

Online Planning in POMDPs with Self-Improving Simulators

Add code
Jan 27, 2022
Figure 1 for Online Planning in POMDPs with Self-Improving Simulators
Figure 2 for Online Planning in POMDPs with Self-Improving Simulators
Figure 3 for Online Planning in POMDPs with Self-Improving Simulators
Figure 4 for Online Planning in POMDPs with Self-Improving Simulators
Viaarxiv icon

Offline Contextual Bandits for Wireless Network Optimization

Add code
Nov 11, 2021
Figure 1 for Offline Contextual Bandits for Wireless Network Optimization
Figure 2 for Offline Contextual Bandits for Wireless Network Optimization
Figure 3 for Offline Contextual Bandits for Wireless Network Optimization
Viaarxiv icon

Influence-Augmented Online Planning for Complex Environments

Add code
Oct 21, 2020
Figure 1 for Influence-Augmented Online Planning for Complex Environments
Figure 2 for Influence-Augmented Online Planning for Complex Environments
Figure 3 for Influence-Augmented Online Planning for Complex Environments
Figure 4 for Influence-Augmented Online Planning for Complex Environments
Viaarxiv icon

Influence-aware Memory for Deep Reinforcement Learning

Add code
Nov 21, 2019
Figure 1 for Influence-aware Memory for Deep Reinforcement Learning
Figure 2 for Influence-aware Memory for Deep Reinforcement Learning
Figure 3 for Influence-aware Memory for Deep Reinforcement Learning
Figure 4 for Influence-aware Memory for Deep Reinforcement Learning
Viaarxiv icon