Picture for Jun Gao

Jun Gao

NVIDIA, University of Toronto, Vector Institute

Xuanwu: Evolving General Multimodal Models into an Industrial-Grade Foundation for Content Ecosystems

Add code
Mar 31, 2026
Viaarxiv icon

StuPASE: Towards Low-Hallucination Studio-Quality Generative Speech Enhancement

Add code
Mar 10, 2026
Viaarxiv icon

AutoQRA: Joint Optimization of Mixed-Precision Quantization and Low-rank Adapters for Efficient LLM Fine-Tuning

Add code
Feb 25, 2026
Viaarxiv icon

Amber-Image: Efficient Compression of Large-Scale Diffusion Transformers

Add code
Feb 19, 2026
Viaarxiv icon

Improving Medical Visual Reinforcement Fine-Tuning via Perception and Reasoning Augmentation

Add code
Feb 11, 2026
Viaarxiv icon

SoulX-FlashHead: Oracle-guided Generation of Infinite Real-time Streaming Talking Heads

Add code
Feb 07, 2026
Viaarxiv icon

HUMANLLM: Benchmarking and Reinforcing LLM Anthropomorphism via Human Cognitive Patterns

Add code
Jan 15, 2026
Viaarxiv icon

Motion Attribution for Video Generation

Add code
Jan 13, 2026
Viaarxiv icon

The Semantic Architect: How FEAML Bridges Structured Data and LLMs for Multi-Label Tasks

Add code
Dec 17, 2025
Viaarxiv icon

ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation

Add code
Oct 05, 2025
Viaarxiv icon