Picture for Yu Cheng

Yu Cheng

Native Hybrid Attention for Efficient Sequence Modeling

Add code
Oct 08, 2025
Figure 1 for Native Hybrid Attention for Efficient Sequence Modeling
Figure 2 for Native Hybrid Attention for Efficient Sequence Modeling
Figure 3 for Native Hybrid Attention for Efficient Sequence Modeling
Figure 4 for Native Hybrid Attention for Efficient Sequence Modeling
Viaarxiv icon

ExGRPO: Learning to Reason from Experience

Add code
Oct 02, 2025
Viaarxiv icon

Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration

Add code
Sep 18, 2025
Viaarxiv icon

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Add code
Sep 10, 2025
Figure 1 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 2 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 3 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 4 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Viaarxiv icon

Interleaving Reasoning for Better Text-to-Image Generation

Add code
Sep 09, 2025
Viaarxiv icon

Synthesizing Sheet Music Problems for Evaluation and Reinforcement Learning

Add code
Sep 04, 2025
Viaarxiv icon

Improved Personalized Headline Generation via Denoising Fake Interests from Implicit Feedback

Add code
Aug 10, 2025
Viaarxiv icon

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law

Add code
Jul 24, 2025
Viaarxiv icon

SeerAttention-R: Sparse Attention Adaptation for Long Reasoning

Add code
Jun 10, 2025
Figure 1 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 2 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 3 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 4 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Viaarxiv icon

Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning

Add code
Jun 04, 2025
Figure 1 for Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning
Figure 2 for Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning
Figure 3 for Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning
Figure 4 for Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning
Viaarxiv icon