Picture for Yinglun Zhu

Yinglun Zhu

Closing the Reflection Gap: A Free Calibration Bonus for Agentic RL

Add code
Jun 12, 2026
Viaarxiv icon

Active Testing of Large Language Models via Approximate Neyman Allocation

Add code
May 11, 2026
Viaarxiv icon

Adaptive Test-Time Compute Allocation with Evolving In-Context Demonstrations

Add code
Apr 22, 2026
Viaarxiv icon

Online Finetuning Decision Transformers with Pure RL Gradients

Add code
Jan 01, 2026
Viaarxiv icon

Interactive Machine Learning: From Theory to Scale

Add code
Dec 30, 2025
Viaarxiv icon

Strategic Scaling of Test-Time Compute: A Bandit Learning Approach

Add code
Jun 15, 2025
Viaarxiv icon

Mixtraining: A Better Trade-Off Between Compute and Performance

Add code
Feb 26, 2025
Figure 1 for Mixtraining: A Better Trade-Off Between Compute and Performance
Figure 2 for Mixtraining: A Better Trade-Off Between Compute and Performance
Figure 3 for Mixtraining: A Better Trade-Off Between Compute and Performance
Figure 4 for Mixtraining: A Better Trade-Off Between Compute and Performance
Viaarxiv icon

Efficient Sparse PCA via Block-Diagonalization

Add code
Oct 18, 2024
Figure 1 for Efficient Sparse PCA via Block-Diagonalization
Figure 2 for Efficient Sparse PCA via Block-Diagonalization
Figure 3 for Efficient Sparse PCA via Block-Diagonalization
Figure 4 for Efficient Sparse PCA via Block-Diagonalization
Viaarxiv icon

Efficient Sequential Decision Making with Large Language Models

Add code
Jun 17, 2024
Figure 1 for Efficient Sequential Decision Making with Large Language Models
Figure 2 for Efficient Sequential Decision Making with Large Language Models
Figure 3 for Efficient Sequential Decision Making with Large Language Models
Figure 4 for Efficient Sequential Decision Making with Large Language Models
Viaarxiv icon

An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models

Add code
Jan 12, 2024
Figure 1 for An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models
Figure 2 for An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models
Figure 3 for An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models
Figure 4 for An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models
Viaarxiv icon