Picture for Yu Cheng

Yu Cheng

Query as Anchor: Scenario-Adaptive User Representation via Large Language Model

Add code
Feb 17, 2026
Viaarxiv icon

ExpertWeaver: Unlocking the Inherent MoE in Dense LLMs with GLU Activation Patterns

Add code
Feb 17, 2026
Viaarxiv icon

HiFloat4 Format for Language Model Inference

Add code
Feb 13, 2026
Viaarxiv icon

Look Inward to Explore Outward: Learning Temperature Policy from LLM Internal States via Hierarchical RL

Add code
Feb 13, 2026
Viaarxiv icon

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

Add code
Feb 12, 2026
Viaarxiv icon

How Do Decoder-Only LLMs Perceive Users? Rethinking Attention Masking for User Representation Learning

Add code
Feb 11, 2026
Viaarxiv icon

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Add code
Feb 10, 2026
Viaarxiv icon

New Skills or Sharper Primitives? A Probabilistic Perspective on the Emergence of Reasoning in RLVR

Add code
Feb 09, 2026
Viaarxiv icon

Characterizing, Evaluating, and Optimizing Complex Reasoning

Add code
Feb 09, 2026
Viaarxiv icon

Affordance-Aware Interactive Decision-Making and Execution for Ambiguous Instructions

Add code
Feb 05, 2026
Viaarxiv icon