Picture for Zhiwei He

Zhiwei He

Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique

Add code
Mar 21, 2025
Viaarxiv icon

RaSA: Rank-Sharing Low-Rank Adaptation

Add code
Mar 16, 2025
Viaarxiv icon

The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models

Add code
Mar 04, 2025
Viaarxiv icon

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Add code
Jan 30, 2025
Figure 1 for Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Figure 2 for Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Figure 3 for Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Figure 4 for Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Viaarxiv icon

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Add code
Dec 30, 2024
Viaarxiv icon

Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding

Add code
Nov 27, 2024
Viaarxiv icon

Weak-to-Strong Preference Optimization: Stealing Reward from Weak Aligned Model

Add code
Oct 24, 2024
Viaarxiv icon

VoxelTrack: Exploring Voxel Representation for 3D Point Cloud Object Tracking

Add code
Aug 05, 2024
Viaarxiv icon

P2P: Part-to-Part Motion Cues Guide a Strong Tracking Framework for LiDAR Point Clouds

Add code
Jul 09, 2024
Viaarxiv icon

Evaluating Knowledge-based Cross-lingual Inconsistency in Large Language Models

Add code
Jul 01, 2024
Viaarxiv icon