Picture for Ishaan Shah

Ishaan Shah

Rethinking Reflection in Pre-Training

Add code
Apr 05, 2025
Viaarxiv icon

DiVA-360: The Dynamic Visuo-Audio Dataset for Immersive Neural Fields

Add code
Jul 31, 2023
Viaarxiv icon

Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback

Add code
Sep 15, 2021
Figure 1 for Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback
Figure 2 for Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback
Viaarxiv icon