Picture for Han Zhao

Han Zhao

How Difficulty-Aware Staged Reinforcement Learning Enhances LLMs' Reasoning Capabilities: A Preliminary Experimental Study

Add code
Apr 01, 2025
Viaarxiv icon

Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking

Add code
Mar 25, 2025
Viaarxiv icon

1.4 Million Open-Source Distilled Reasoning Dataset to Empower Large Language Model Training

Add code
Mar 25, 2025
Viaarxiv icon

Predicting Potential Customer Support Needs and Optimizing Search Ranking in a Two-Sided Marketplace

Add code
Mar 21, 2025
Viaarxiv icon

Towards Scalable Foundation Model for Multi-modal and Hyperspectral Geospatial Data

Add code
Mar 17, 2025
Viaarxiv icon

MoRE: Unlocking Scalability in Reinforcement Learning for Quadruped Vision-Language-Action Models

Add code
Mar 11, 2025
Viaarxiv icon

Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding

Add code
Mar 04, 2025
Viaarxiv icon

Structural Alignment Improves Graph Test-Time Adaptation

Add code
Feb 25, 2025
Viaarxiv icon

VLAS: Vision-Language-Action Model With Speech Instructions For Customized Robot Manipulation

Add code
Feb 19, 2025
Viaarxiv icon

Efficient Model Editing with Task Vector Bases: A Theoretical Framework and Scalable Approach

Add code
Feb 03, 2025
Viaarxiv icon