Picture for Kefan Dong

Kefan Dong

Formal Theorem Proving by Rewarding LLMs to Decompose Proofs Hierarchically

Add code
Nov 04, 2024
Viaarxiv icon

Beyond NTK with Vanilla Gradient Descent: A Mean-Field Analysis of Neural Networks with Polynomial Width, Samples, and Time

Add code
Jun 28, 2023
Viaarxiv icon

Toward $L_\infty$-recovery of Nonlinear Functions: A Polynomial Sample Complexity Bound for Gaussian Random Fields

Add code
Apr 29, 2023
Viaarxiv icon

Model-based Offline Reinforcement Learning with Local Misspecification

Add code
Jan 26, 2023
Viaarxiv icon

First Steps Toward Understanding the Extrapolation of Nonlinear Models to Unseen Domains

Add code
Dec 01, 2022
Viaarxiv icon

Asymptotic Instance-Optimal Algorithms for Interactive Decision Making

Add code
Jun 06, 2022
Viaarxiv icon

Design of Experiments for Stochastic Contextual Linear Bandits

Add code
Jul 22, 2021
Figure 1 for Design of Experiments for Stochastic Contextual Linear Bandits
Figure 2 for Design of Experiments for Stochastic Contextual Linear Bandits
Figure 3 for Design of Experiments for Stochastic Contextual Linear Bandits
Figure 4 for Design of Experiments for Stochastic Contextual Linear Bandits
Viaarxiv icon

Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve Optimism, Embrace Virtual Curvature

Add code
Feb 08, 2021
Figure 1 for Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve Optimism, Embrace Virtual Curvature
Viaarxiv icon

Refined Analysis of FPL for Adversarial Markov Decision Processes

Add code
Aug 21, 2020
Figure 1 for Refined Analysis of FPL for Adversarial Markov Decision Processes
Viaarxiv icon

Multinomial Logit Bandit with Low Switching Cost

Add code
Jul 09, 2020
Viaarxiv icon