Picture for Qunshan Gu

Qunshan Gu

VocalNet: Speech LLM with Multi-Token Prediction for Faster and High-Quality Generation

Add code
Apr 05, 2025
Viaarxiv icon