Picture for Laurie M. Heller

Laurie M. Heller

Department of Psychology, Carnegie Mellon University

Vision Language Models Are Few-Shot Audio Spectrogram Classifiers

Add code
Nov 18, 2024
Viaarxiv icon

Challenge on Sound Scene Synthesis: Evaluating Text-to-Audio Generation

Add code
Oct 23, 2024
Figure 1 for Challenge on Sound Scene Synthesis: Evaluating Text-to-Audio Generation
Figure 2 for Challenge on Sound Scene Synthesis: Evaluating Text-to-Audio Generation
Figure 3 for Challenge on Sound Scene Synthesis: Evaluating Text-to-Audio Generation
Figure 4 for Challenge on Sound Scene Synthesis: Evaluating Text-to-Audio Generation
Viaarxiv icon

Detection of Deepfake Environmental Audio

Add code
Mar 26, 2024
Figure 1 for Detection of Deepfake Environmental Audio
Figure 2 for Detection of Deepfake Environmental Audio
Figure 3 for Detection of Deepfake Environmental Audio
Viaarxiv icon

Correlation of Fréchet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant

Add code
Mar 26, 2024
Figure 1 for Correlation of Fréchet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant
Figure 2 for Correlation of Fréchet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant
Figure 3 for Correlation of Fréchet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant
Figure 4 for Correlation of Fréchet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant
Viaarxiv icon

Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session

Add code
Feb 24, 2023
Viaarxiv icon

Identifying Actions for Sound Event Classification

Add code
Apr 26, 2021
Figure 1 for Identifying Actions for Sound Event Classification
Figure 2 for Identifying Actions for Sound Event Classification
Figure 3 for Identifying Actions for Sound Event Classification
Figure 4 for Identifying Actions for Sound Event Classification
Viaarxiv icon