Picture for Shizhu He

Shizhu He

The Zero-Step Thinking: An Empirical Study of Mode Selection as Harder Early Exit in Reasoning Models

Add code
Oct 22, 2025
Viaarxiv icon

Towards Agentic Self-Learning LLMs in Search Environment

Add code
Oct 16, 2025
Viaarxiv icon

SparK: Query-Aware Unstructured Sparsity with Recoverable KV Cache Channel Pruning

Add code
Aug 21, 2025
Viaarxiv icon

Socratic-PRMBench: Benchmarking Process Reward Models with Systematic Reasoning Patterns

Add code
May 29, 2025
Viaarxiv icon

Amplify Adjacent Token Differences: Enhancing Long Chain-of-Thought Reasoning with Shift-FFN

Add code
May 22, 2025
Viaarxiv icon

Beyond Hard and Soft: Hybrid Context Compression for Balancing Local and Global Information Retention

Add code
May 21, 2025
Viaarxiv icon

Neural Incompatibility: The Unbridgeable Gap of Cross-Scale Parametric Knowledge Transfer in Large Language Models

Add code
May 20, 2025
Viaarxiv icon

Better wit than wealth: Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement

Add code
Mar 31, 2025
Viaarxiv icon

Probabilistic Uncertain Reward Model: A Natural Generalization of Bradley-Terry Reward Model

Add code
Mar 28, 2025
Figure 1 for Probabilistic Uncertain Reward Model: A Natural Generalization of Bradley-Terry Reward Model
Figure 2 for Probabilistic Uncertain Reward Model: A Natural Generalization of Bradley-Terry Reward Model
Figure 3 for Probabilistic Uncertain Reward Model: A Natural Generalization of Bradley-Terry Reward Model
Figure 4 for Probabilistic Uncertain Reward Model: A Natural Generalization of Bradley-Terry Reward Model
Viaarxiv icon

GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks

Add code
Feb 20, 2025
Figure 1 for GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks
Figure 2 for GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks
Figure 3 for GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks
Figure 4 for GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks
Viaarxiv icon