Picture for Ning Miao

Ning Miao

Beyond Ideal Instruction: A Comprehensive Framework for Evaluating LLMs in Realistic Interactions

Add code
Jun 03, 2026
Viaarxiv icon

TD-Grokking: Learning from Zero-Reward Problems by Training-Time Decomposition

Add code
Jun 03, 2026
Viaarxiv icon

STT-Arena: A More Realistic Environment for Tool-Using with Spatio-Temporal Dynamics

Add code
May 18, 2026
Viaarxiv icon

Verifier-Backed Hard Problem Generation for Mathematical Reasoning

Add code
May 07, 2026
Viaarxiv icon

Step-Level Sparse Autoencoder for Reasoning Process Interpretation

Add code
Mar 03, 2026
Viaarxiv icon

SkillCraft: Can LLM Agents Learn to Use Tools Skillfully?

Add code
Feb 28, 2026
Viaarxiv icon

Not All Steps are Informative: On the Linearity of LLMs' RLVR Training

Add code
Jan 08, 2026
Viaarxiv icon

Enhancing Large Language Model Reasoning with Reward Models: An Analytical Survey

Add code
Oct 02, 2025
Viaarxiv icon

BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design

Add code
Aug 28, 2025
Viaarxiv icon

SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning

Add code
Aug 02, 2023
Viaarxiv icon