Picture for Zhifeng Kong

Zhifeng Kong

TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization

Add code
Dec 30, 2024
Viaarxiv icon

ETTA: Elucidating the Design Space of Text-to-Audio Models

Add code
Dec 26, 2024
Viaarxiv icon

Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data

Add code
Oct 02, 2024
Figure 1 for Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data
Figure 2 for Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data
Figure 3 for Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data
Figure 4 for Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data
Viaarxiv icon

A Geometry-Aware Algorithm to Learn Hierarchical Embeddings in Hyperbolic Space

Add code
Jul 23, 2024
Viaarxiv icon

Improving Text-To-Audio Models with Synthetic Captions

Add code
Jun 18, 2024
Figure 1 for Improving Text-To-Audio Models with Synthetic Captions
Figure 2 for Improving Text-To-Audio Models with Synthetic Captions
Figure 3 for Improving Text-To-Audio Models with Synthetic Captions
Figure 4 for Improving Text-To-Audio Models with Synthetic Captions
Viaarxiv icon

Audio Dialogues: Dialogues dataset for audio and music understanding

Add code
Apr 11, 2024
Figure 1 for Audio Dialogues: Dialogues dataset for audio and music understanding
Figure 2 for Audio Dialogues: Dialogues dataset for audio and music understanding
Figure 3 for Audio Dialogues: Dialogues dataset for audio and music understanding
Figure 4 for Audio Dialogues: Dialogues dataset for audio and music understanding
Viaarxiv icon

Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities

Add code
Feb 02, 2024
Viaarxiv icon

CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram

Add code
Sep 12, 2023
Viaarxiv icon

Data Redaction from Conditional Generative Models

Add code
May 18, 2023
Viaarxiv icon

Can Membership Inferencing be Refuted?

Add code
Mar 08, 2023
Viaarxiv icon