Picture for Francisco S. Melo

Francisco S. Melo

Implicit Repair with Reinforcement Learning in Emergent Communication

Add code
Feb 18, 2025
Viaarxiv icon

Distributed Value Decomposition Networks with Networked Agents

Add code
Feb 11, 2025
Viaarxiv icon

Networked Agents in the Dark: Team Value Learning under Partial Observability

Add code
Jan 15, 2025
Viaarxiv icon

NeuralThink: Algorithm Synthesis that Extrapolates in General Tasks

Add code
Feb 23, 2024
Figure 1 for NeuralThink: Algorithm Synthesis that Extrapolates in General Tasks
Figure 2 for NeuralThink: Algorithm Synthesis that Extrapolates in General Tasks
Figure 3 for NeuralThink: Algorithm Synthesis that Extrapolates in General Tasks
Figure 4 for NeuralThink: Algorithm Synthesis that Extrapolates in General Tasks
Viaarxiv icon

Making Friends in the Dark: Ad Hoc Teamwork Under Partial Observability

Add code
Sep 30, 2023
Figure 1 for Making Friends in the Dark: Ad Hoc Teamwork Under Partial Observability
Figure 2 for Making Friends in the Dark: Ad Hoc Teamwork Under Partial Observability
Figure 3 for Making Friends in the Dark: Ad Hoc Teamwork Under Partial Observability
Figure 4 for Making Friends in the Dark: Ad Hoc Teamwork Under Partial Observability
Viaarxiv icon

Multi-Bellman operator for convergence of $Q$-learning with linear function approximation

Add code
Sep 28, 2023
Viaarxiv icon

Interactively Teaching an Inverse Reinforcement Learner with Limited Feedback

Add code
Sep 16, 2023
Viaarxiv icon

Learning to Perceive in Deep Model-Free Reinforcement Learning

Add code
Jan 13, 2023
Viaarxiv icon

Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning

Add code
Oct 12, 2022
Figure 1 for Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning
Figure 2 for Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning
Figure 3 for Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning
Figure 4 for Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning
Viaarxiv icon

"Guess what I'm doing": Extending legibility to sequential decision tasks

Add code
Sep 19, 2022
Figure 1 for "Guess what I'm doing": Extending legibility to sequential decision tasks
Figure 2 for "Guess what I'm doing": Extending legibility to sequential decision tasks
Figure 3 for "Guess what I'm doing": Extending legibility to sequential decision tasks
Figure 4 for "Guess what I'm doing": Extending legibility to sequential decision tasks
Viaarxiv icon