Picture for Daniel Tompkins

Daniel Tompkins

Laugh Now Cry Later: Controlling Time-Varying Emotional States of Flow-Matching-Based Zero-Shot Text-to-Speech

Add code
Jul 17, 2024
Figure 1 for Laugh Now Cry Later: Controlling Time-Varying Emotional States of Flow-Matching-Based Zero-Shot Text-to-Speech
Figure 2 for Laugh Now Cry Later: Controlling Time-Varying Emotional States of Flow-Matching-Based Zero-Shot Text-to-Speech
Figure 3 for Laugh Now Cry Later: Controlling Time-Varying Emotional States of Flow-Matching-Based Zero-Shot Text-to-Speech
Figure 4 for Laugh Now Cry Later: Controlling Time-Varying Emotional States of Flow-Matching-Based Zero-Shot Text-to-Speech
Viaarxiv icon

BEATs: Audio Pre-Training with Acoustic Tokenizers

Add code
Dec 18, 2022
Viaarxiv icon

Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study

Add code
Feb 07, 2022
Figure 1 for Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study
Figure 2 for Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study
Figure 3 for Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study
Figure 4 for Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study
Viaarxiv icon

COVID-19 Detection Using Recorded Coughs in the 2021 DiCOVA Challenge

Add code
May 22, 2021
Figure 1 for COVID-19 Detection Using Recorded Coughs in the 2021 DiCOVA Challenge
Figure 2 for COVID-19 Detection Using Recorded Coughs in the 2021 DiCOVA Challenge
Figure 3 for COVID-19 Detection Using Recorded Coughs in the 2021 DiCOVA Challenge
Figure 4 for COVID-19 Detection Using Recorded Coughs in the 2021 DiCOVA Challenge
Viaarxiv icon

Multi-label Sound Event Retrieval Using a Deep Learning-based Siamese Structure with a Pairwise Presence Matrix

Add code
Feb 20, 2020
Figure 1 for Multi-label Sound Event Retrieval Using a Deep Learning-based Siamese Structure with a Pairwise Presence Matrix
Figure 2 for Multi-label Sound Event Retrieval Using a Deep Learning-based Siamese Structure with a Pairwise Presence Matrix
Figure 3 for Multi-label Sound Event Retrieval Using a Deep Learning-based Siamese Structure with a Pairwise Presence Matrix
Viaarxiv icon