Picture for Yue Zhang

Yue Zhang

Renmin University of China

EviProp: Seeded Relevance Diffusion on Chunk-Page Graphs for Long Multimodal Document Retrieval

Add code
Jun 08, 2026
Viaarxiv icon

UNIVID: Unified Vision-Language Model for Video Moderation

Add code
Jun 04, 2026
Viaarxiv icon

Beyond Absolute Scores: Relative Edit-induced Difference for Generalizable Image Aesthetic Assessment

Add code
Jun 04, 2026
Viaarxiv icon

PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training

Add code
Jun 02, 2026
Viaarxiv icon

Unified Video-Action Joint Denoising for Dexterous Action and Data Generation

Add code
Jun 02, 2026
Viaarxiv icon

Seeing Isn't Knowing: Do VLMs Know When Not to Answer Spatial Questions (and Why)?

Add code
May 28, 2026
Viaarxiv icon

Active Evidence-Seeking and Diagnostic Reasoning in Large Language Models for Clinical Decision Support

Add code
May 21, 2026
Viaarxiv icon

AiraXiv: An AI-Driven Open-Access Platform for Human and AI Scientists

Add code
May 20, 2026
Viaarxiv icon

On the Cost and Benefit of Chain of Thought: A Learning-Theoretic Perspective

Add code
May 20, 2026
Viaarxiv icon

PhyMotion: Structured 3D Motion Reward for Physics-Grounded Human Video Generation

Add code
May 14, 2026
Viaarxiv icon