Picture for Donghang Wu

Donghang Wu

Tony

Mind-Paced Speaking: A Dual-Brain Approach to Real-Time Reasoning in Spoken Language Models

Add code
Oct 10, 2025
Viaarxiv icon

Step-Audio 2 Technical Report

Add code
Jul 24, 2025
Viaarxiv icon

Cross-attention Inspired Selective State Space Models for Target Sound Extraction

Add code
Sep 10, 2024
Figure 1 for Cross-attention Inspired Selective State Space Models for Target Sound Extraction
Figure 2 for Cross-attention Inspired Selective State Space Models for Target Sound Extraction
Figure 3 for Cross-attention Inspired Selective State Space Models for Target Sound Extraction
Figure 4 for Cross-attention Inspired Selective State Space Models for Target Sound Extraction
Viaarxiv icon

Leveraging Moving Sound Source Trajectories for Universal Sound Separation

Add code
Sep 07, 2024
Figure 1 for Leveraging Moving Sound Source Trajectories for Universal Sound Separation
Figure 2 for Leveraging Moving Sound Source Trajectories for Universal Sound Separation
Figure 3 for Leveraging Moving Sound Source Trajectories for Universal Sound Separation
Figure 4 for Leveraging Moving Sound Source Trajectories for Universal Sound Separation
Viaarxiv icon