Picture for Hui Wang

Hui Wang

Queen's University Belfast, UK

To Be Multimodal or Not to Be: Query-Adaptive Audio-Visual Person Retrieval via Active Modality Detection

Add code
Jun 04, 2026
Viaarxiv icon

UAT: Unified Audio-Text Diffusion for Audio Generation, Editing, and Captioning

Add code
Jun 03, 2026
Viaarxiv icon

CardioLens: Revealing the Clinical Reality Gap of MLLMs via Multi-Sequence Cardiac MRI Evaluations

Add code
May 28, 2026
Viaarxiv icon

MeniOmni: A Structured Multimodal Benchmark for Holistic Meniscus Injury Assessment

Add code
May 27, 2026
Viaarxiv icon

EGL-SCA: Structural Credit Assignment for Co-Evolving Instructions and Tools in Graph Reasoning Agents

Add code
May 11, 2026
Viaarxiv icon

MolRecBench-Wild: A Real-World Benchmark for Optical Chemical Structure Recognition

Add code
May 07, 2026
Viaarxiv icon

Safactory: A Scalable Agent Factory for Trustworthy Autonomous Intelligence

Add code
May 07, 2026
Viaarxiv icon

Kwai Summary Attention Technical Report

Add code
Apr 27, 2026
Viaarxiv icon

Autonomous UAV Pipeline Near-proximity Inspection via Disturbance-Aware Predictive Visual Servoing

Add code
Apr 21, 2026
Viaarxiv icon

OSC: Hardware Efficient W4A4 Quantization via Outlier Separation in Channel Dimension

Add code
Apr 14, 2026
Viaarxiv icon