Picture for Jianye Hao

Jianye Hao

Short Chains, Deep Thoughts: Balancing Reasoning Efficiency and Intra-Segment Capability via Split-Merge Optimization

Add code
Feb 03, 2026
Viaarxiv icon

Why Attention Patterns Exist: A Unifying Temporal Perspective Analysis

Add code
Jan 29, 2026
Viaarxiv icon

Ratio-Variance Regularized Policy Optimization for Efficient LLM Fine-tuning

Add code
Jan 06, 2026
Viaarxiv icon

Enhancing the Medical Context-Awareness Ability of LLMs via Multifaceted Self-Refinement Learning

Add code
Nov 14, 2025
Viaarxiv icon

Hi-Agent: Hierarchical Vision-Language Agents for Mobile Device Control

Add code
Oct 16, 2025
Viaarxiv icon

Embodied Arena: A Comprehensive, Unified, and Evolving Evaluation Platform for Embodied AI

Add code
Sep 18, 2025
Viaarxiv icon

OmniEVA: Embodied Versatile Planner via Task-Adaptive 3D-Grounded and Embodiment-aware Reasoning

Add code
Sep 11, 2025
Viaarxiv icon

Uncertainty-quantified Rollout Policy Adaptation for Unlabelled Cross-domain Temporal Grounding

Add code
Aug 08, 2025
Viaarxiv icon

Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model

Add code
Jul 09, 2025
Figure 1 for Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model
Figure 2 for Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model
Figure 3 for Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model
Figure 4 for Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model
Viaarxiv icon

Reasoning on a Budget: A Survey of Adaptive and Controllable Test-Time Compute in LLMs

Add code
Jul 02, 2025
Viaarxiv icon