Picture for Sam Lobel

Sam Lobel

Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy

Add code
Jul 10, 2024
Viaarxiv icon

An Optimal Tightness Bound for the Simulation Lemma

Add code
Jun 24, 2024
Viaarxiv icon

Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning

Add code
Jun 05, 2023
Viaarxiv icon

Coarse-Grained Smoothness for RL in Metric Spaces

Add code
Oct 23, 2021
Figure 1 for Coarse-Grained Smoothness for RL in Metric Spaces
Figure 2 for Coarse-Grained Smoothness for RL in Metric Spaces
Figure 3 for Coarse-Grained Smoothness for RL in Metric Spaces
Figure 4 for Coarse-Grained Smoothness for RL in Metric Spaces
Viaarxiv icon

Towards Amortized Ranking-Critical Training for Collaborative Filtering

Add code
Jun 10, 2019
Figure 1 for Towards Amortized Ranking-Critical Training for Collaborative Filtering
Figure 2 for Towards Amortized Ranking-Critical Training for Collaborative Filtering
Figure 3 for Towards Amortized Ranking-Critical Training for Collaborative Filtering
Figure 4 for Towards Amortized Ranking-Critical Training for Collaborative Filtering
Viaarxiv icon