Picture for Yang Shi

Yang Shi

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Add code
May 25, 2026
Viaarxiv icon

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

Add code
May 21, 2026
Viaarxiv icon

Semantic Granularity Navigation in Image Editing

Add code
May 20, 2026
Viaarxiv icon

Beyond Rational Illusion: Behaviorally Realistic Strategic Classification

Add code
May 19, 2026
Viaarxiv icon

MSAVBench: Towards Comprehensive and Reliable Evaluation of Multi-Shot Audio-Video Generation

Add code
May 19, 2026
Viaarxiv icon

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

Add code
May 18, 2026
Viaarxiv icon

Edit-Compass & EditReward-Compass: A Unified Benchmark for Image Editing and Reward Modeling

Add code
May 13, 2026
Viaarxiv icon

Beyond the Last Layer: Multi-Layer Representation Fusion for Visual Tokenizatio

Add code
May 11, 2026
Viaarxiv icon

CCD-Level and Load-Aware Thread Orchestration for In-Memory Vector ANNS on Multi-Core CPUs

Add code
May 11, 2026
Viaarxiv icon

Majorization-Guided Test-Time Adaptation for Vision-Language Models under Modality-Specific Shift

Add code
Apr 27, 2026
Viaarxiv icon