Picture for Amir-massoud Farahmand

Amir-massoud Farahmand

MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL

Add code
Oct 11, 2024
Viaarxiv icon

Deflated Dynamics Value Iteration

Add code
Jul 15, 2024
Viaarxiv icon

PID Accelerated Temporal Difference Algorithms

Add code
Jul 11, 2024
Viaarxiv icon

When does Self-Prediction help? Understanding Auxiliary Tasks in Reinforcement Learning

Add code
Jun 25, 2024
Viaarxiv icon

Dissecting Deep RL with High Update Ratios: Combatting Value Overestimation and Divergence

Add code
Mar 09, 2024
Viaarxiv icon

Improving Adversarial Transferability via Model Alignment

Add code
Nov 30, 2023
Viaarxiv icon

Maximum Entropy Model Correction in Reinforcement Learning

Add code
Nov 29, 2023
Viaarxiv icon

Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods

Add code
Aug 13, 2023
Viaarxiv icon

Efficient and Accurate Optimal Transport with Mirror Descent and Conjugate Gradients

Add code
Jul 17, 2023
Viaarxiv icon

Distributional Model Equivalence for Risk-Sensitive Reinforcement Learning

Add code
Jul 04, 2023
Viaarxiv icon