Picture for Alberto Sardinha

Alberto Sardinha

INESC-ID Lisboa, Instituto Superior Técnico

Learning to Perceive in Deep Model-Free Reinforcement Learning

Add code
Jan 13, 2023
Viaarxiv icon

Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning

Add code
Oct 12, 2022
Figure 1 for Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning
Figure 2 for Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning
Figure 3 for Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning
Figure 4 for Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning
Viaarxiv icon

Perceive, Represent, Generate: Translating Multimodal Information to Robotic Motion Trajectories

Add code
Apr 06, 2022
Figure 1 for Perceive, Represent, Generate: Translating Multimodal Information to Robotic Motion Trajectories
Figure 2 for Perceive, Represent, Generate: Translating Multimodal Information to Robotic Motion Trajectories
Figure 3 for Perceive, Represent, Generate: Translating Multimodal Information to Robotic Motion Trajectories
Figure 4 for Perceive, Represent, Generate: Translating Multimodal Information to Robotic Motion Trajectories
Viaarxiv icon

Onception: Active Learning with Expert Advice for Real World Machine Translation

Add code
Mar 12, 2022
Figure 1 for Onception: Active Learning with Expert Advice for Real World Machine Translation
Figure 2 for Onception: Active Learning with Expert Advice for Real World Machine Translation
Figure 3 for Onception: Active Learning with Expert Advice for Real World Machine Translation
Figure 4 for Onception: Active Learning with Expert Advice for Real World Machine Translation
Viaarxiv icon

Assisting Unknown Teammates in Unknown Tasks: Ad Hoc Teamwork under Partial Observability

Add code
Jan 10, 2022
Figure 1 for Assisting Unknown Teammates in Unknown Tasks: Ad Hoc Teamwork under Partial Observability
Figure 2 for Assisting Unknown Teammates in Unknown Tasks: Ad Hoc Teamwork under Partial Observability
Figure 3 for Assisting Unknown Teammates in Unknown Tasks: Ad Hoc Teamwork under Partial Observability
Figure 4 for Assisting Unknown Teammates in Unknown Tasks: Ad Hoc Teamwork under Partial Observability
Viaarxiv icon

Understanding the Impact of Data Distribution on Q-learning with Function Approximation

Add code
Nov 23, 2021
Figure 1 for Understanding the Impact of Data Distribution on Q-learning with Function Approximation
Figure 2 for Understanding the Impact of Data Distribution on Q-learning with Function Approximation
Figure 3 for Understanding the Impact of Data Distribution on Q-learning with Function Approximation
Figure 4 for Understanding the Impact of Data Distribution on Q-learning with Function Approximation
Viaarxiv icon

Online Learning Meets Machine Translation Evaluation: Finding the Best Systems with the Least Human Effort

Add code
May 27, 2021
Figure 1 for Online Learning Meets Machine Translation Evaluation: Finding the Best Systems with the Least Human Effort
Figure 2 for Online Learning Meets Machine Translation Evaluation: Finding the Best Systems with the Least Human Effort
Figure 3 for Online Learning Meets Machine Translation Evaluation: Finding the Best Systems with the Least Human Effort
Figure 4 for Online Learning Meets Machine Translation Evaluation: Finding the Best Systems with the Least Human Effort
Viaarxiv icon

A Methodology for the Development of RL-Based Adaptive Traffic Signal Controllers

Add code
Jan 24, 2021
Figure 1 for A Methodology for the Development of RL-Based Adaptive Traffic Signal Controllers
Figure 2 for A Methodology for the Development of RL-Based Adaptive Traffic Signal Controllers
Figure 3 for A Methodology for the Development of RL-Based Adaptive Traffic Signal Controllers
Figure 4 for A Methodology for the Development of RL-Based Adaptive Traffic Signal Controllers
Viaarxiv icon