Picture for Ruiqi Zhang

Ruiqi Zhang

SPEED-RL: Faster Training of Reasoning Models via Online Curriculum Learning

Add code
Jun 10, 2025
Viaarxiv icon

Long or short CoT? Investigating Instance-level Switch of Large Reasoning Models

Add code
Jun 04, 2025
Viaarxiv icon

MLMC-based Resource Adequacy Assessment with Active Learning Trained Surrogate Models

Add code
May 27, 2025
Viaarxiv icon

Duawlfin: A Drone with Unified Actuation for Wheeled Locomotion and Flight Operation

Add code
May 20, 2025
Viaarxiv icon

Minimax Optimal Convergence of Gradient Descent in Logistic Regression via Large and Adaptive Stepsizes

Add code
Apr 05, 2025
Viaarxiv icon

Mitigating Ambiguities in 3D Classification with Gaussian Splatting

Add code
Mar 11, 2025
Viaarxiv icon

How Do LLMs Perform Two-Hop Reasoning in Context?

Add code
Feb 19, 2025
Viaarxiv icon

Fast Best-of-N Decoding via Speculative Rejection

Add code
Oct 26, 2024
Viaarxiv icon

Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning

Add code
Oct 09, 2024
Viaarxiv icon

Negative Preference Optimization: From Catastrophic Collapse to Effective Unlearning

Add code
Apr 08, 2024
Viaarxiv icon