Picture for Prem Seetharaman

Prem Seetharaman

PromptSep: Generative Audio Separation via Multimodal Prompting

Add code
Nov 06, 2025
Viaarxiv icon

The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling

Add code
Sep 19, 2025
Figure 1 for The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling
Figure 2 for The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling
Figure 3 for The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling
Figure 4 for The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling
Viaarxiv icon

FLAM: Frame-Wise Language-Audio Modeling

Add code
May 08, 2025
Viaarxiv icon

SILA: Signal-to-Language Augmentation for Enhanced Control in Text-to-Audio Generation

Add code
Dec 13, 2024
Viaarxiv icon

Sketch2Sound: Controllable Audio Generation via Time-Varying Signals and Sonic Imitations

Add code
Dec 11, 2024
Figure 1 for Sketch2Sound: Controllable Audio Generation via Time-Varying Signals and Sonic Imitations
Figure 2 for Sketch2Sound: Controllable Audio Generation via Time-Varying Signals and Sonic Imitations
Figure 3 for Sketch2Sound: Controllable Audio Generation via Time-Varying Signals and Sonic Imitations
Viaarxiv icon

Video-Guided Foley Sound Generation with Multimodal Controls

Add code
Nov 26, 2024
Figure 1 for Video-Guided Foley Sound Generation with Multimodal Controls
Figure 2 for Video-Guided Foley Sound Generation with Multimodal Controls
Figure 3 for Video-Guided Foley Sound Generation with Multimodal Controls
Figure 4 for Video-Guided Foley Sound Generation with Multimodal Controls
Viaarxiv icon

Code Drift: Towards Idempotent Neural Audio Codecs

Add code
Oct 14, 2024
Figure 1 for Code Drift: Towards Idempotent Neural Audio Codecs
Figure 2 for Code Drift: Towards Idempotent Neural Audio Codecs
Figure 3 for Code Drift: Towards Idempotent Neural Audio Codecs
Figure 4 for Code Drift: Towards Idempotent Neural Audio Codecs
Viaarxiv icon

VampNet: Music Generation via Masked Acoustic Token Modeling

Add code
Jul 12, 2023
Viaarxiv icon

High-Fidelity Audio Compression with Improved RVQGAN

Add code
Jun 11, 2023
Figure 1 for High-Fidelity Audio Compression with Improved RVQGAN
Figure 2 for High-Fidelity Audio Compression with Improved RVQGAN
Figure 3 for High-Fidelity Audio Compression with Improved RVQGAN
Figure 4 for High-Fidelity Audio Compression with Improved RVQGAN
Viaarxiv icon

Music Separation Enhancement with Generative Modeling

Add code
Aug 26, 2022
Figure 1 for Music Separation Enhancement with Generative Modeling
Figure 2 for Music Separation Enhancement with Generative Modeling
Figure 3 for Music Separation Enhancement with Generative Modeling
Figure 4 for Music Separation Enhancement with Generative Modeling
Viaarxiv icon