Picture for Simone Parisi

Simone Parisi

Beyond Optimism: Exploration With Partially Observable Rewards

Add code
Jun 20, 2024
Viaarxiv icon

Monitored Markov Decision Processes

Add code
Feb 13, 2024
Figure 1 for Monitored Markov Decision Processes
Figure 2 for Monitored Markov Decision Processes
Figure 3 for Monitored Markov Decision Processes
Figure 4 for Monitored Markov Decision Processes
Viaarxiv icon

The Unsurprising Effectiveness of Pre-Trained Vision Models for Control

Add code
Mar 07, 2022
Figure 1 for The Unsurprising Effectiveness of Pre-Trained Vision Models for Control
Figure 2 for The Unsurprising Effectiveness of Pre-Trained Vision Models for Control
Figure 3 for The Unsurprising Effectiveness of Pre-Trained Vision Models for Control
Figure 4 for The Unsurprising Effectiveness of Pre-Trained Vision Models for Control
Viaarxiv icon

Interesting Object, Curious Agent: Learning Task-Agnostic Exploration

Add code
Nov 25, 2021
Figure 1 for Interesting Object, Curious Agent: Learning Task-Agnostic Exploration
Figure 2 for Interesting Object, Curious Agent: Learning Task-Agnostic Exploration
Figure 3 for Interesting Object, Curious Agent: Learning Task-Agnostic Exploration
Figure 4 for Interesting Object, Curious Agent: Learning Task-Agnostic Exploration
Viaarxiv icon

Long-Term Visitation Value for Deep Exploration in Sparse Reward Reinforcement Learning

Add code
Jan 01, 2020
Figure 1 for Long-Term Visitation Value for Deep Exploration in Sparse Reward Reinforcement Learning
Figure 2 for Long-Term Visitation Value for Deep Exploration in Sparse Reward Reinforcement Learning
Figure 3 for Long-Term Visitation Value for Deep Exploration in Sparse Reward Reinforcement Learning
Figure 4 for Long-Term Visitation Value for Deep Exploration in Sparse Reward Reinforcement Learning
Viaarxiv icon

TD-Regularized Actor-Critic Methods

Add code
Dec 23, 2018
Figure 1 for TD-Regularized Actor-Critic Methods
Figure 2 for TD-Regularized Actor-Critic Methods
Figure 3 for TD-Regularized Actor-Critic Methods
Figure 4 for TD-Regularized Actor-Critic Methods
Viaarxiv icon

Policy Search with High-Dimensional Context Variables

Add code
Nov 10, 2016
Figure 1 for Policy Search with High-Dimensional Context Variables
Figure 2 for Policy Search with High-Dimensional Context Variables
Figure 3 for Policy Search with High-Dimensional Context Variables
Figure 4 for Policy Search with High-Dimensional Context Variables
Viaarxiv icon

Multi-objective Reinforcement Learning with Continuous Pareto Frontier Approximation Supplementary Material

Add code
Nov 18, 2014
Figure 1 for Multi-objective Reinforcement Learning with Continuous Pareto Frontier Approximation Supplementary Material
Figure 2 for Multi-objective Reinforcement Learning with Continuous Pareto Frontier Approximation Supplementary Material
Viaarxiv icon