Picture for Yan Ma

Yan Ma

Visual Programmability: A Guide for Code-as-Thought in Chart Understanding

Add code
Sep 11, 2025
Viaarxiv icon

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Add code
Jun 16, 2025
Viaarxiv icon

Thinking with Generated Images

Add code
May 28, 2025
Viaarxiv icon

One RL to See Them All: Visual Triple Unified Reinforcement Learning

Add code
May 23, 2025
Viaarxiv icon

Generative AI Act II: Test Time Scaling Drives Cognition Engineering

Add code
Apr 21, 2025
Viaarxiv icon

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Add code
Apr 03, 2025
Viaarxiv icon

Weak-to-Strong Reasoning

Add code
Jul 18, 2024
Viaarxiv icon

ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation

Add code
Jul 08, 2024
Viaarxiv icon

MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations

Add code
Jul 01, 2024
Viaarxiv icon

OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI

Add code
Jun 18, 2024
Figure 1 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 2 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 3 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 4 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Viaarxiv icon