Picture for Prem Seetharaman

Prem Seetharaman

SILA: Signal-to-Language Augmentation for Enhanced Control in Text-to-Audio Generation

Add code
Dec 13, 2024
Viaarxiv icon

Sketch2Sound: Controllable Audio Generation via Time-Varying Signals and Sonic Imitations

Add code
Dec 11, 2024
Viaarxiv icon

Video-Guided Foley Sound Generation with Multimodal Controls

Add code
Nov 26, 2024
Viaarxiv icon

Code Drift: Towards Idempotent Neural Audio Codecs

Add code
Oct 14, 2024
Viaarxiv icon

VampNet: Music Generation via Masked Acoustic Token Modeling

Add code
Jul 12, 2023
Viaarxiv icon

High-Fidelity Audio Compression with Improved RVQGAN

Add code
Jun 11, 2023
Viaarxiv icon

Music Separation Enhancement with Generative Modeling

Add code
Aug 26, 2022
Figure 1 for Music Separation Enhancement with Generative Modeling
Figure 2 for Music Separation Enhancement with Generative Modeling
Figure 3 for Music Separation Enhancement with Generative Modeling
Figure 4 for Music Separation Enhancement with Generative Modeling
Viaarxiv icon

How to Listen? Rethinking Visual Sound Localization

Add code
Apr 11, 2022
Figure 1 for How to Listen? Rethinking Visual Sound Localization
Figure 2 for How to Listen? Rethinking Visual Sound Localization
Figure 3 for How to Listen? Rethinking Visual Sound Localization
Figure 4 for How to Listen? Rethinking Visual Sound Localization
Viaarxiv icon

Unsupervised Source Separation By Steering Pretrained Music Models

Add code
Oct 25, 2021
Figure 1 for Unsupervised Source Separation By Steering Pretrained Music Models
Figure 2 for Unsupervised Source Separation By Steering Pretrained Music Models
Figure 3 for Unsupervised Source Separation By Steering Pretrained Music Models
Viaarxiv icon

Wav2CLIP: Learning Robust Audio Representations From CLIP

Add code
Oct 21, 2021
Figure 1 for Wav2CLIP: Learning Robust Audio Representations From CLIP
Figure 2 for Wav2CLIP: Learning Robust Audio Representations From CLIP
Figure 3 for Wav2CLIP: Learning Robust Audio Representations From CLIP
Figure 4 for Wav2CLIP: Learning Robust Audio Representations From CLIP
Viaarxiv icon