Picture for Hui Wang

Hui Wang

Queen's University Belfast, UK

FoleyGenEx: Unified Video-to-Audio Generation with Multi-Modal Control, Temporal Alignment, and Semantic Precision

Add code
Jun 12, 2026
Viaarxiv icon

Self-Guidance: Enhancing Neural Codecs via Decoder Manifold Alignment

Add code
Jun 11, 2026
Viaarxiv icon

Latent World Recovery for Multimodal Learning with Missing Modalities

Add code
Jun 10, 2026
Viaarxiv icon

To Be Multimodal or Not to Be: Query-Adaptive Audio-Visual Person Retrieval via Active Modality Detection

Add code
Jun 04, 2026
Viaarxiv icon

UAT: Unified Audio-Text Diffusion for Audio Generation, Editing, and Captioning

Add code
Jun 03, 2026
Viaarxiv icon

CardioLens: Revealing the Clinical Reality Gap of MLLMs via Multi-Sequence Cardiac MRI Evaluations

Add code
May 28, 2026
Viaarxiv icon

MeniOmni: A Structured Multimodal Benchmark for Holistic Meniscus Injury Assessment

Add code
May 27, 2026
Viaarxiv icon

EGL-SCA: Structural Credit Assignment for Co-Evolving Instructions and Tools in Graph Reasoning Agents

Add code
May 11, 2026
Viaarxiv icon

Safactory: A Scalable Agent Factory for Trustworthy Autonomous Intelligence

Add code
May 07, 2026
Viaarxiv icon

MolRecBench-Wild: A Real-World Benchmark for Optical Chemical Structure Recognition

Add code
May 07, 2026
Viaarxiv icon