Picture for Yuanxing Zhang

Yuanxing Zhang

CoVEBench: Can Video Editing Models Handle Complex Instructions?

Add code
Jun 07, 2026
Viaarxiv icon

OmniCap-IF: Benchmarking and Improving Instruction Following Abilities for Omni-Video Captioning

Add code
Jun 07, 2026
Viaarxiv icon

MMG2Skill: Can Agents Distill In-the-Wild Guides into Self-Evolving Skills?

Add code
Jun 01, 2026
Viaarxiv icon

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Add code
May 25, 2026
Viaarxiv icon

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

Add code
May 21, 2026
Viaarxiv icon

Beyond Rational Illusion: Behaviorally Realistic Strategic Classification

Add code
May 19, 2026
Viaarxiv icon

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

Add code
May 18, 2026
Viaarxiv icon

Edit-Compass & EditReward-Compass: A Unified Benchmark for Image Editing and Reward Modeling

Add code
May 13, 2026
Viaarxiv icon

Beyond the Last Layer: Multi-Layer Representation Fusion for Visual Tokenizatio

Add code
May 11, 2026
Viaarxiv icon

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Add code
Apr 06, 2026
Viaarxiv icon