Picture for Yan Ma

Yan Ma

What Does Vision Tool-Use Reinforcement Learning Really Learn? Disentangling Tool-Induced and Intrinsic Effects for Crop-and-Zoom

Add code
Feb 01, 2026
Viaarxiv icon

Adaptively trained Physics-informed Radial Basis Function Neural Networks for Solving Multi-asset Option Pricing Problems

Add code
Jan 19, 2026
Viaarxiv icon

Visual Programmability: A Guide for Code-as-Thought in Chart Understanding

Add code
Sep 11, 2025
Viaarxiv icon

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Add code
Jun 16, 2025
Figure 1 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Figure 2 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Figure 3 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Figure 4 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Viaarxiv icon

Thinking with Generated Images

Add code
May 28, 2025
Viaarxiv icon

One RL to See Them All: Visual Triple Unified Reinforcement Learning

Add code
May 23, 2025
Figure 1 for One RL to See Them All: Visual Triple Unified Reinforcement Learning
Figure 2 for One RL to See Them All: Visual Triple Unified Reinforcement Learning
Figure 3 for One RL to See Them All: Visual Triple Unified Reinforcement Learning
Figure 4 for One RL to See Them All: Visual Triple Unified Reinforcement Learning
Viaarxiv icon

Generative AI Act II: Test Time Scaling Drives Cognition Engineering

Add code
Apr 21, 2025
Figure 1 for Generative AI Act II: Test Time Scaling Drives Cognition Engineering
Figure 2 for Generative AI Act II: Test Time Scaling Drives Cognition Engineering
Figure 3 for Generative AI Act II: Test Time Scaling Drives Cognition Engineering
Figure 4 for Generative AI Act II: Test Time Scaling Drives Cognition Engineering
Viaarxiv icon

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Add code
Apr 03, 2025
Viaarxiv icon

Weak-to-Strong Reasoning

Add code
Jul 18, 2024
Viaarxiv icon

ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation

Add code
Jul 08, 2024
Viaarxiv icon