Picture for Nathan Kallus

Nathan Kallus

Causal Inference on Networks under Misspecified Exposure Mappings: A Partial Identification Framework

Add code
Feb 03, 2026
Viaarxiv icon

Exploration in the Limit

Add code
Dec 31, 2025
Viaarxiv icon

Efficient Inference for Inverse Reinforcement Learning and Dynamic Discrete Choice Models

Add code
Dec 30, 2025
Viaarxiv icon

Stationary Reweighting Yields Local Convergence of Soft Fitted Q-Iteration

Add code
Dec 30, 2025
Viaarxiv icon

Fitted Q Evaluation Without Bellman Completeness via Stationary Weighting

Add code
Dec 29, 2025
Viaarxiv icon

Bellman Calibration for V-Learning in Offline Reinforcement Learning

Add code
Dec 29, 2025
Viaarxiv icon

Semiparametric Preference Optimization: Your Language Model is Secretly a Single-Index Model

Add code
Dec 26, 2025
Viaarxiv icon

The Value of Personalized Recommendations: Evidence from Netflix

Add code
Nov 11, 2025
Viaarxiv icon

DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning

Add code
Oct 02, 2025
Figure 1 for DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning
Figure 2 for DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning
Figure 3 for DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning
Figure 4 for DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning
Viaarxiv icon

Entropy After $\langle \texttt{/Think} \rangle$ for reasoning model early exiting

Add code
Sep 30, 2025
Figure 1 for Entropy After $\langle \texttt{/Think} \rangle$ for reasoning model early exiting
Figure 2 for Entropy After $\langle \texttt{/Think} \rangle$ for reasoning model early exiting
Figure 3 for Entropy After $\langle \texttt{/Think} \rangle$ for reasoning model early exiting
Figure 4 for Entropy After $\langle \texttt{/Think} \rangle$ for reasoning model early exiting
Viaarxiv icon