Picture for Shagun Sodhani

Shagun Sodhani

Effect of Document Packing on the Latent Multi-Hop Reasoning Capabilities of Large Language Models

Add code
Dec 16, 2025
Figure 1 for Effect of Document Packing on the Latent Multi-Hop Reasoning Capabilities of Large Language Models
Figure 2 for Effect of Document Packing on the Latent Multi-Hop Reasoning Capabilities of Large Language Models
Figure 3 for Effect of Document Packing on the Latent Multi-Hop Reasoning Capabilities of Large Language Models
Figure 4 for Effect of Document Packing on the Latent Multi-Hop Reasoning Capabilities of Large Language Models
Viaarxiv icon

Scaling and Distilling Transformer Models for sEMG

Add code
Jul 29, 2025
Viaarxiv icon

AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench

Add code
Jul 03, 2025
Viaarxiv icon

Do Large Language Models Know How Much They Know?

Add code
Feb 26, 2025
Viaarxiv icon

MaestroMotif: Skill Design from Artificial Intelligence Feedback

Add code
Dec 11, 2024
Figure 1 for MaestroMotif: Skill Design from Artificial Intelligence Feedback
Figure 2 for MaestroMotif: Skill Design from Artificial Intelligence Feedback
Figure 3 for MaestroMotif: Skill Design from Artificial Intelligence Feedback
Figure 4 for MaestroMotif: Skill Design from Artificial Intelligence Feedback
Viaarxiv icon

EpiK-Eval: Evaluation for Language Models as Epistemic Models

Add code
Oct 23, 2023
Viaarxiv icon

Motif: Intrinsic Motivation from Artificial Intelligence Feedback

Add code
Sep 29, 2023
Figure 1 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Figure 2 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Figure 3 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Figure 4 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Viaarxiv icon

TorchRL: A data-driven decision-making library for PyTorch

Add code
Jun 01, 2023
Viaarxiv icon

Sequence Modeling is a Robust Contender for Offline Reinforcement Learning

Add code
May 26, 2023
Viaarxiv icon

VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training

Add code
Sep 30, 2022
Figure 1 for VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training
Figure 2 for VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training
Figure 3 for VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training
Figure 4 for VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training
Viaarxiv icon