Picture for Jun Gao

Jun Gao

NVIDIA, University of Toronto, Vector Institute

UniPASE: A Generative Model for Universal Speech Enhancement with High Fidelity and Low Hallucinations

Add code
Apr 16, 2026
Viaarxiv icon

Lyra 2.0: Explorable Generative 3D Worlds

Add code
Apr 14, 2026
Viaarxiv icon

Making MLLMs Blind: Adversarial Smuggling Attacks in MLLM Content Moderation

Add code
Apr 09, 2026
Viaarxiv icon

MoRight: Motion Control Done Right

Add code
Apr 08, 2026
Viaarxiv icon

Beyond Semantic Search: Towards Referential Anchoring in Composed Image Retrieval

Add code
Apr 07, 2026
Viaarxiv icon

Xuanwu: Evolving General Multimodal Models into an Industrial-Grade Foundation for Content Ecosystems

Add code
Mar 31, 2026
Viaarxiv icon

StuPASE: Towards Low-Hallucination Studio-Quality Generative Speech Enhancement

Add code
Mar 10, 2026
Viaarxiv icon

AutoQRA: Joint Optimization of Mixed-Precision Quantization and Low-rank Adapters for Efficient LLM Fine-Tuning

Add code
Feb 25, 2026
Viaarxiv icon

Amber-Image: Efficient Compression of Large-Scale Diffusion Transformers

Add code
Feb 19, 2026
Viaarxiv icon

Improving Medical Visual Reinforcement Fine-Tuning via Perception and Reasoning Augmentation

Add code
Feb 11, 2026
Viaarxiv icon