Picture for Joon Son Chung

Joon Son Chung

SpeakerLLM: A Speaker-Specialized Audio-LLM for Speaker Understanding and Verification Reasoning

Add code
May 14, 2026
Viaarxiv icon

FiTS: Interpretable Spiking Neurons via Frequency Selectivity and Temporal Shaping

Add code
May 13, 2026
Viaarxiv icon

Keep What Audio Cannot Say: Context-Preserving Token Pruning for Omni-LLMs

Add code
May 12, 2026
Viaarxiv icon

Probing Cross-modal Information Hubs in Audio-Visual LLMs

Add code
May 11, 2026
Viaarxiv icon

Seeing Through Touch: Tactile-Driven Visual Localization of Material Regions

Add code
Apr 13, 2026
Viaarxiv icon

Cinematic Audio Source Separation Using Visual Cues

Add code
Mar 27, 2026
Viaarxiv icon

Plug-and-Steer: Decoupling Separation and Selection in Audio-Visual Target Speaker Extraction

Add code
Mar 20, 2026
Viaarxiv icon

On the Nature of Attention Sink that Shapes Decoding Strategy in MLLMs

Add code
Mar 15, 2026
Viaarxiv icon

MamTra: A Hybrid Mamba-Transformer Backbone for Speech Synthesis

Add code
Mar 12, 2026
Viaarxiv icon

FastAV: Efficient Token Pruning for Audio-Visual Large Language Model Inference

Add code
Jan 19, 2026
Viaarxiv icon