Picture for Paul Primus

Paul Primus

Effective Pre-Training of Audio Transformers for Sound Event Detection

Add code
Sep 14, 2024
Figure 1 for Effective Pre-Training of Audio Transformers for Sound Event Detection
Figure 2 for Effective Pre-Training of Audio Transformers for Sound Event Detection
Figure 3 for Effective Pre-Training of Audio Transformers for Sound Event Detection
Viaarxiv icon

Estimated Audio-Caption Correspondences Improve Language-Based Audio Retrieval

Add code
Aug 21, 2024
Viaarxiv icon

Improving Query-by-Vocal Imitation with Contrastive Learning and Audio Pretraining

Add code
Aug 21, 2024
Viaarxiv icon

Multi-Iteration Multi-Stage Fine-Tuning of Transformers for Sound Event Detection with Heterogeneous Datasets

Add code
Jul 17, 2024
Viaarxiv icon

Fusing Audio and Metadata Embeddings Improves Language-based Audio Retrieval

Add code
Jun 22, 2024
Viaarxiv icon

Data-Efficient Low-Complexity Acoustic Scene Classification in the DCASE 2024 Challenge

Add code
May 16, 2024
Viaarxiv icon

Advancing Natural-Language Based Audio Retrieval with PaSST and Large Audio-Caption Data Sets

Add code
Aug 08, 2023
Viaarxiv icon

Improving Natural-Language-based Audio Retrieval with Transfer Learning and Audio & Text Augmentations

Add code
Aug 24, 2022
Figure 1 for Improving Natural-Language-based Audio Retrieval with Transfer Learning and Audio & Text Augmentations
Figure 2 for Improving Natural-Language-based Audio Retrieval with Transfer Learning and Audio & Text Augmentations
Figure 3 for Improving Natural-Language-based Audio Retrieval with Transfer Learning and Audio & Text Augmentations
Figure 4 for Improving Natural-Language-based Audio Retrieval with Transfer Learning and Audio & Text Augmentations
Viaarxiv icon

Improved Zero-Shot Audio Tagging & Classification with Patchout Spectrogram Transformers

Add code
Aug 24, 2022
Figure 1 for Improved Zero-Shot Audio Tagging & Classification with Patchout Spectrogram Transformers
Figure 2 for Improved Zero-Shot Audio Tagging & Classification with Patchout Spectrogram Transformers
Figure 3 for Improved Zero-Shot Audio Tagging & Classification with Patchout Spectrogram Transformers
Figure 4 for Improved Zero-Shot Audio Tagging & Classification with Patchout Spectrogram Transformers
Viaarxiv icon

Anomalous Sound Detection as a Simple Binary Classification Problem with Careful Selection of Proxy Outlier Examples

Add code
Nov 05, 2020
Figure 1 for Anomalous Sound Detection as a Simple Binary Classification Problem with Careful Selection of Proxy Outlier Examples
Figure 2 for Anomalous Sound Detection as a Simple Binary Classification Problem with Careful Selection of Proxy Outlier Examples
Figure 3 for Anomalous Sound Detection as a Simple Binary Classification Problem with Careful Selection of Proxy Outlier Examples
Figure 4 for Anomalous Sound Detection as a Simple Binary Classification Problem with Careful Selection of Proxy Outlier Examples
Viaarxiv icon