Picture for Xiaotian Han

Xiaotian Han

AgentCE-Bench: Agent Configurable Evaluation with Scalable Horizons and Controllable Difficulty under Lightweight Environments

Add code
Apr 10, 2026
Viaarxiv icon

Demystifying OPD: Length Inflation and Stabilization Strategies for Large Language Models

Add code
Apr 09, 2026
Viaarxiv icon

ACE-Bench: Agent Configurable Evaluation with Scalable Horizons and Controllable Difficulty under Lightweight Environments

Add code
Apr 07, 2026
Viaarxiv icon

Rethinking Exploration in RLVR: From Entropy Regularization to Refinement via Bidirectional Entropy Modulation

Add code
Apr 06, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon

When Domains Interact: Asymmetric and Order-Sensitive Cross-Domain Effects in Reinforcement Learning for Reasoning

Add code
Feb 01, 2026
Viaarxiv icon

Mid-Think: Training-Free Intermediate-Budget Reasoning via Token-Level Triggers

Add code
Jan 11, 2026
Viaarxiv icon

Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?

Add code
Oct 14, 2025
Figure 1 for Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?
Figure 2 for Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?
Figure 3 for Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?
Figure 4 for Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?
Viaarxiv icon

Quantized but Deceptive? A Multi-Dimensional Truthfulness Evaluation of Quantized LLMs

Add code
Aug 26, 2025
Viaarxiv icon

When Truthful Representations Flip Under Deceptive Instructions?

Add code
Jul 29, 2025
Viaarxiv icon