Picture for Zhifang Sui

Zhifang Sui

CoLT: Reasoning with Chain of Latent Tool Calls

Add code
Feb 04, 2026
Viaarxiv icon

Decoding in Geometry: Alleviating Embedding-Space Crowding for Complex Reasoning

Add code
Jan 30, 2026
Viaarxiv icon

TeachBench: A Syllabus-Grounded Framework for Evaluating Teaching Ability in Large Language Models

Add code
Jan 29, 2026
Viaarxiv icon

GroundingME: Exposing the Visual Grounding Gap in MLLMs through Multi-Dimensional Evaluation

Add code
Dec 19, 2025
Viaarxiv icon

Towards Stable and Effective Reinforcement Learning for Mixture-of-Experts

Add code
Oct 27, 2025
Viaarxiv icon

LLM-REVal: Can We Trust LLM Reviewers Yet?

Add code
Oct 14, 2025
Figure 1 for LLM-REVal: Can We Trust LLM Reviewers Yet?
Figure 2 for LLM-REVal: Can We Trust LLM Reviewers Yet?
Figure 3 for LLM-REVal: Can We Trust LLM Reviewers Yet?
Figure 4 for LLM-REVal: Can We Trust LLM Reviewers Yet?
Viaarxiv icon

Reinforcement Pre-Training

Add code
Jun 09, 2025
Figure 1 for Reinforcement Pre-Training
Figure 2 for Reinforcement Pre-Training
Figure 3 for Reinforcement Pre-Training
Figure 4 for Reinforcement Pre-Training
Viaarxiv icon

HauntAttack: When Attack Follows Reasoning as a Shadow

Add code
Jun 08, 2025
Figure 1 for HauntAttack: When Attack Follows Reasoning as a Shadow
Figure 2 for HauntAttack: When Attack Follows Reasoning as a Shadow
Figure 3 for HauntAttack: When Attack Follows Reasoning as a Shadow
Figure 4 for HauntAttack: When Attack Follows Reasoning as a Shadow
Viaarxiv icon

Towards Harmonized Uncertainty Estimation for Large Language Models

Add code
May 25, 2025
Viaarxiv icon

RICo: Refined In-Context Contribution for Automatic Instruction-Tuning Data Selection

Add code
May 18, 2025
Viaarxiv icon