Picture for Abhishek Gupta

Abhishek Gupta

BCG Henderson Institute, Montreal AI Ethics Institute, and, Boston Consulting Group

TMRL: Diffusion Timestep-Modulated Pretraining Enables Exploration for Efficient Policy Finetuning

Add code
May 12, 2026
Viaarxiv icon

Learning to Compress Time-to-Control: A Reinforcement Learning Framework for Chronic Disease Management

Add code
May 10, 2026
Viaarxiv icon

OGPO: Sample Efficient Full-Finetuning of Generative Control Policies

Add code
May 04, 2026
Viaarxiv icon

Transferable Physics-Informed Representations via Closed-Form Head Adaptation

Add code
Apr 23, 2026
Viaarxiv icon

Operator-Theoretic Foundations and Policy Gradient Methods for General MDPs with Unbounded Costs

Add code
Mar 18, 2026
Viaarxiv icon

Simulation Distillation: Pretraining World Models in Simulation for Rapid Real-World Adaptation

Add code
Mar 16, 2026
Viaarxiv icon

Emergent Dexterity via Diverse Resets and Large-Scale Reinforcement Learning

Add code
Mar 16, 2026
Viaarxiv icon

Robometer: Scaling General-Purpose Robotic Reward Models via Trajectory Comparisons

Add code
Mar 02, 2026
Viaarxiv icon

SPARR: Simulation-based Policies with Asymmetric Real-world Residuals for Assembly

Add code
Feb 26, 2026
Viaarxiv icon

RFS: Reinforcement learning with Residual flow steering for dexterous manipulation

Add code
Feb 03, 2026
Viaarxiv icon