Picture for Chao-Han Huck Yang

Chao-Han Huck Yang

Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music

Add code
Apr 13, 2026
Viaarxiv icon

How Auditory Knowledge in LLM Backbones Shapes Audio Language Models: A Holistic Evaluation

Add code
Mar 19, 2026
Viaarxiv icon

The Interspeech 2026 Audio Reasoning Challenge: Evaluating Reasoning Process Quality for Audio Reasoning Models and Agents

Add code
Feb 15, 2026
Viaarxiv icon

Bagpiper: Solving Open-Ended Audio Tasks via Rich Captions

Add code
Feb 05, 2026
Viaarxiv icon

Quantum LEGO Learning: A Modular Design Principle for Hybrid Artificial Intelligence

Add code
Jan 29, 2026
Viaarxiv icon

PRiSM: Benchmarking Phone Realization in Speech Models

Add code
Jan 20, 2026
Viaarxiv icon

Speech-Hands: A Self-Reflection Voice Agentic Approach to Speech Recognition and Audio Reasoning with Omni Perception

Add code
Jan 14, 2026
Viaarxiv icon

Continual Quantum Architecture Search with Tensor-Train Encoding: Theory and Applications to Signal Processing

Add code
Jan 10, 2026
Viaarxiv icon

Random-Matrix-Induced Simplicity Bias in Over-parameterized Variational Quantum Circuits

Add code
Jan 05, 2026
Viaarxiv icon

Long Grounded Thoughts: Distilling Compositional Visual Reasoning Chains at Scale

Add code
Nov 07, 2025
Viaarxiv icon