Picture for Alexander H. Liu

Alexander H. Liu

Overcoming State Inertia in Full-Duplex Spoken Language Models via Activation Steering

Add code
Jun 09, 2026
Viaarxiv icon

USAD 2.0: Scaling Representation Distillation for Universal Audio Understanding

Add code
Jun 04, 2026
Viaarxiv icon

Voxtral TTS

Add code
Mar 26, 2026
Viaarxiv icon

Voxtral Realtime

Add code
Feb 11, 2026
Viaarxiv icon

Ministral 3

Add code
Jan 13, 2026
Viaarxiv icon

Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal

Add code
Dec 14, 2025
Viaarxiv icon

Voxtral

Add code
Jul 17, 2025
Viaarxiv icon

USAD: Universal Speech and Audio Representation via Distillation

Add code
Jun 23, 2025
Figure 1 for USAD: Universal Speech and Audio Representation via Distillation
Figure 2 for USAD: Universal Speech and Audio Representation via Distillation
Figure 3 for USAD: Universal Speech and Audio Representation via Distillation
Figure 4 for USAD: Universal Speech and Audio Representation via Distillation
Viaarxiv icon

Magistral

Add code
Jun 12, 2025
Figure 1 for Magistral
Figure 2 for Magistral
Figure 3 for Magistral
Figure 4 for Magistral
Viaarxiv icon

Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities

Add code
Mar 06, 2025
Figure 1 for Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities
Figure 2 for Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities
Figure 3 for Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities
Figure 4 for Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities
Viaarxiv icon