Picture for Ahmed Touati

Ahmed Touati

Simple Ingredients for Offline Reinforcement Learning

Add code
Mar 19, 2024
Viaarxiv icon

Score Models for Offline Goal-Conditioned Reinforcement Learning

Add code
Nov 03, 2023
Viaarxiv icon

A State Representation for Diminishing Rewards

Add code
Sep 07, 2023
Viaarxiv icon

Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees

Add code
Oct 24, 2022
Viaarxiv icon

Does Zero-Shot Reinforcement Learning Exist?

Add code
Sep 29, 2022
Figure 1 for Does Zero-Shot Reinforcement Learning Exist?
Figure 2 for Does Zero-Shot Reinforcement Learning Exist?
Figure 3 for Does Zero-Shot Reinforcement Learning Exist?
Figure 4 for Does Zero-Shot Reinforcement Learning Exist?
Viaarxiv icon

Learning One Representation to Optimize All Rewards

Add code
Mar 14, 2021
Figure 1 for Learning One Representation to Optimize All Rewards
Figure 2 for Learning One Representation to Optimize All Rewards
Figure 3 for Learning One Representation to Optimize All Rewards
Figure 4 for Learning One Representation to Optimize All Rewards
Viaarxiv icon

Efficient Learning in Non-Stationary Linear Markov Decision Processes

Add code
Oct 24, 2020
Figure 1 for Efficient Learning in Non-Stationary Linear Markov Decision Processes
Viaarxiv icon

Maximum Reward Formulation In Reinforcement Learning

Add code
Oct 08, 2020
Figure 1 for Maximum Reward Formulation In Reinforcement Learning
Figure 2 for Maximum Reward Formulation In Reinforcement Learning
Figure 3 for Maximum Reward Formulation In Reinforcement Learning
Figure 4 for Maximum Reward Formulation In Reinforcement Learning
Viaarxiv icon

Sharp Analysis of Smoothed Bellman Error Embedding

Add code
Jul 07, 2020
Viaarxiv icon

TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?

Add code
Jul 06, 2020
Figure 1 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 2 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 3 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 4 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Viaarxiv icon