Picture for Jiaxin Ai

Jiaxin Ai

Dialogue as Discovery: Navigating Human Intent Through Principled Inquiry

Add code
Oct 31, 2025
Viaarxiv icon

From Pixels to Paths: A Multi-Agent Framework for Editable Scientific Illustration

Add code
Oct 31, 2025
Viaarxiv icon

MDK12-Bench: A Comprehensive Evaluation of Multimodal Large Language Models on Multidisciplinary Exams

Add code
Aug 09, 2025
Viaarxiv icon

Sekai: A Video Dataset towards World Exploration

Add code
Jun 18, 2025
Viaarxiv icon

A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation

Add code
Jun 11, 2025
Viaarxiv icon

SridBench: Benchmark of Scientific Research Illustration Drawing of Image Generation Model

Add code
May 28, 2025
Viaarxiv icon

MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models

Add code
Apr 08, 2025
Viaarxiv icon

MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification

Add code
Mar 16, 2025
Figure 1 for MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification
Figure 2 for MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification
Figure 3 for MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification
Figure 4 for MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification
Viaarxiv icon

PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models

Add code
Mar 16, 2025
Figure 1 for PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models
Figure 2 for PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models
Figure 3 for PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models
Figure 4 for PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models
Viaarxiv icon

ARMOR v0.1: Empowering Autoregressive Multimodal Understanding Model with Interleaved Multimodal Generation via Asymmetric Synergy

Add code
Mar 09, 2025
Figure 1 for ARMOR v0.1: Empowering Autoregressive Multimodal Understanding Model with Interleaved Multimodal Generation via Asymmetric Synergy
Figure 2 for ARMOR v0.1: Empowering Autoregressive Multimodal Understanding Model with Interleaved Multimodal Generation via Asymmetric Synergy
Figure 3 for ARMOR v0.1: Empowering Autoregressive Multimodal Understanding Model with Interleaved Multimodal Generation via Asymmetric Synergy
Figure 4 for ARMOR v0.1: Empowering Autoregressive Multimodal Understanding Model with Interleaved Multimodal Generation via Asymmetric Synergy
Viaarxiv icon