Picture for Xuyang Shen

Xuyang Shen

FlashSampling: Fast and Memory-Efficient Exact Sampling

Add code
Mar 16, 2026
Viaarxiv icon

What Does Vision Tool-Use Reinforcement Learning Really Learn? Disentangling Tool-Induced and Intrinsic Effects for Crop-and-Zoom

Add code
Feb 01, 2026
Viaarxiv icon

Elucidating the Design Space of Decay in Linear Attention

Add code
Sep 05, 2025
Viaarxiv icon

Autoregressive Image Generation with Linear Complexity: A Spatial-Aware Decay Perspective

Add code
Jul 02, 2025
Viaarxiv icon

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Add code
Jun 16, 2025
Figure 1 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Figure 2 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Figure 3 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Figure 4 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Viaarxiv icon

One RL to See Them All: Visual Triple Unified Reinforcement Learning

Add code
May 23, 2025
Figure 1 for One RL to See Them All: Visual Triple Unified Reinforcement Learning
Figure 2 for One RL to See Them All: Visual Triple Unified Reinforcement Learning
Figure 3 for One RL to See Them All: Visual Triple Unified Reinforcement Learning
Figure 4 for One RL to See Them All: Visual Triple Unified Reinforcement Learning
Viaarxiv icon

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Add code
Apr 03, 2025
Viaarxiv icon

Multi-modal Time Series Analysis: A Tutorial and Survey

Add code
Mar 17, 2025
Viaarxiv icon

MiniMax-01: Scaling Foundation Models with Lightning Attention

Add code
Jan 14, 2025
Viaarxiv icon

Scaling Laws for Linear Complexity Language Models

Add code
Jun 24, 2024
Viaarxiv icon