Picture for Xinming Wang

Xinming Wang

When Seeing Is Not Believing -- A Benchmark for Search-Grounded Video Misinformation Detection

Add code
Jun 02, 2026
Viaarxiv icon

WikiSeeker: Rethinking the Role of Vision-Language Models in Knowledge-Based Visual Question Answering

Add code
Apr 07, 2026
Viaarxiv icon

Omni IIE Bench: Benchmarking the Practical Capabilities of Image Editing Models

Add code
Mar 16, 2026
Viaarxiv icon

Towards Principled Dataset Distillation: A Spectral Distribution Perspective

Add code
Mar 02, 2026
Viaarxiv icon

Beyond Closed-Pool Video Retrieval: A Benchmark and Agent Framework for Real-World Video Search and Moment Localization

Add code
Feb 10, 2026
Viaarxiv icon

PaperX: A Unified Framework for Multimodal Academic Presentation Generation with Scholar DAG

Add code
Feb 05, 2026
Viaarxiv icon

ShotFinder: Imagination-Driven Open-Domain Video Shot Retrieval via Web Search

Add code
Jan 30, 2026
Viaarxiv icon

VDE Bench: Evaluating The Capability of Image Editing Models to Modify Visual Documents

Add code
Jan 27, 2026
Viaarxiv icon

RPO:Reinforcement Fine-Tuning with Partial Reasoning Optimization

Add code
Jan 27, 2026
Viaarxiv icon

TwinBrainVLA: Unleashing the Potential of Generalist VLMs for Embodied Tasks via Asymmetric Mixture-of-Transformers

Add code
Jan 20, 2026
Viaarxiv icon