Picture for Ziyi Yang

Ziyi Yang

UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation

Add code
May 30, 2025
Viaarxiv icon

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Add code
May 23, 2025
Viaarxiv icon

ThinkSwitcher: When to Think Hard, When to Think Fast

Add code
May 20, 2025
Viaarxiv icon

Beyond Task and Motion Planning: Hierarchical Robot Planning with General-Purpose Policies

Add code
Apr 24, 2025
Viaarxiv icon

FuseRL: Dense Preference Optimization for Heterogeneous Model Fusion

Add code
Apr 09, 2025
Viaarxiv icon

Scaling Laws of Synthetic Data for Language Models

Add code
Mar 26, 2025
Viaarxiv icon

FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion

Add code
Mar 06, 2025
Viaarxiv icon

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Add code
Mar 03, 2025
Viaarxiv icon

Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective

Add code
Jan 19, 2025
Viaarxiv icon

Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model

Add code
Jan 06, 2025
Viaarxiv icon