Picture for Hao Wang

Hao Wang

Xidian University, China

ShotFinder: Imagination-Driven Open-Domain Video Shot Retrieval via Web Search

Add code
Jan 30, 2026
Viaarxiv icon

On the Adversarial Robustness of Large Vision-Language Models under Visual Token Compression

Add code
Jan 29, 2026
Viaarxiv icon

When Gradient Optimization Is Not Enough: $\dagger$ Dispersive and Anchoring Geometric Regularizer for Multimodal Learning

Add code
Jan 29, 2026
Viaarxiv icon

RAW-Flow: Advancing RGB-to-RAW Image Reconstruction with Deterministic Latent Flow Matching

Add code
Jan 28, 2026
Viaarxiv icon

DEEPMED: Building a Medical DeepResearch Agent via Multi-hop Med-Search Data and Turn-Controlled Agentic Training & Inference

Add code
Jan 26, 2026
Viaarxiv icon

Spatial-Conditioned Reasoning in Long-Egocentric Videos

Add code
Jan 26, 2026
Viaarxiv icon

HyperWalker: Dynamic Hypergraph-Based Deep Diagnosis for Multi-Hop Clinical Modeling across EHR and X-Ray in Medical VLMs

Add code
Jan 20, 2026
Viaarxiv icon

A Unified Variational Imputation Framework for Electric Vehicle Charging Data Using Retrieval-Augmented Language Model

Add code
Jan 20, 2026
Viaarxiv icon

Be Your Own Red Teamer: Safety Alignment via Self-Play and Reflective Experience Replay

Add code
Jan 15, 2026
Viaarxiv icon

Motion Focus Recognition in Fast-Moving Egocentric Video

Add code
Jan 12, 2026
Viaarxiv icon