Picture for Dhruv Rohatgi

Dhruv Rohatgi

Is a Good Foundation Necessary for Efficient Reinforcement Learning? The Computational Role of the Base Model in Exploration

Add code
Mar 10, 2025
Viaarxiv icon

Computational-Statistical Tradeoffs at the Next-Token Prediction Barrier: Autoregressive and Imitation Learning under Misspecification

Add code
Feb 18, 2025
Viaarxiv icon

Necessary and Sufficient Oracles: Toward a Computational Taxonomy For Reinforcement Learning

Add code
Feb 12, 2025
Viaarxiv icon

Self-Improvement in Language Models: The Sharpening Mechanism

Add code
Dec 02, 2024
Viaarxiv icon

Towards characterizing the value of edge embeddings in Graph Neural Networks

Add code
Oct 13, 2024
Viaarxiv icon

Online Control in Population Dynamics

Add code
Jun 03, 2024
Viaarxiv icon

Exploration is Harder than Prediction: Cryptographically Separating Reinforcement Learning from Supervised Learning

Add code
Apr 04, 2024
Viaarxiv icon

Lasso with Latents: Efficient Estimation, Covariate Rescaling, and Computational-Statistical Gaps

Add code
Feb 23, 2024
Figure 1 for Lasso with Latents: Efficient Estimation, Covariate Rescaling, and Computational-Statistical Gaps
Figure 2 for Lasso with Latents: Efficient Estimation, Covariate Rescaling, and Computational-Statistical Gaps
Viaarxiv icon

Exploring and Learning in Sparse Linear MDPs without Computationally Intractable Oracles

Add code
Sep 19, 2023
Viaarxiv icon

Provable benefits of score matching

Add code
Jun 03, 2023
Viaarxiv icon