Picture for Haoqian Wang

Haoqian Wang

When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning

Add code
Mar 22, 2026
Viaarxiv icon

Efficient Long-Horizon GUI Agents via Training-Free KV Cache Compression

Add code
Feb 27, 2026
Viaarxiv icon

R3G: A Reasoning--Retrieval--Reranking Framework for Vision-Centric Answer Generation

Add code
Jan 25, 2026
Viaarxiv icon

Language-Guided and Motion-Aware Gait Representation for Generalizable Recognition

Add code
Jan 17, 2026
Viaarxiv icon

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Add code
Dec 30, 2025
Viaarxiv icon

EchoMotion: Unified Human Video and Motion Generation via Dual-Modality Diffusion Transformer

Add code
Dec 21, 2025
Viaarxiv icon

Measuring and Controlling the Spectral Bias for Self-Supervised Image Denoising

Add code
Oct 01, 2025
Viaarxiv icon

DreamSwapV: Mask-guided Subject Swapping for Any Customized Video Editing

Add code
Aug 20, 2025
Viaarxiv icon

MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds

Add code
Aug 20, 2025
Viaarxiv icon

Make Your MoVe: Make Your 3D Contents by Adapting Multi-View Diffusion Models to External Editing

Add code
Aug 11, 2025
Viaarxiv icon