Picture for Tuomas Virtanen

Tuomas Virtanen

Tampere University

Text-based Audio Retrieval by Learning from Similarities between Audio Captions

Add code
Dec 02, 2024
Viaarxiv icon

A decade of DCASE: Achievements, practices, evaluations and future challenges

Add code
Oct 07, 2024
Viaarxiv icon

SynthSOD: Developing an Heterogeneous Dataset for Orchestra Music Source Separation

Add code
Sep 17, 2024
Figure 1 for SynthSOD: Developing an Heterogeneous Dataset for Orchestra Music Source Separation
Figure 2 for SynthSOD: Developing an Heterogeneous Dataset for Orchestra Music Source Separation
Figure 3 for SynthSOD: Developing an Heterogeneous Dataset for Orchestra Music Source Separation
Figure 4 for SynthSOD: Developing an Heterogeneous Dataset for Orchestra Music Source Separation
Viaarxiv icon

Multi-label Zero-Shot Audio Classification with Temporal Attention

Add code
Aug 31, 2024
Viaarxiv icon

Noise-to-mask Ratio Loss for Deep Neural Network based Audio Watermarking

Add code
Aug 28, 2024
Viaarxiv icon

Integrating Continuous and Binary Relevances in Audio-Text Relevance Learning

Add code
Aug 27, 2024
Figure 1 for Integrating Continuous and Binary Relevances in Audio-Text Relevance Learning
Figure 2 for Integrating Continuous and Binary Relevances in Audio-Text Relevance Learning
Figure 3 for Integrating Continuous and Binary Relevances in Audio-Text Relevance Learning
Figure 4 for Integrating Continuous and Binary Relevances in Audio-Text Relevance Learning
Viaarxiv icon

Computer Audition: From Task-Specific Machine Learning to Foundation Models

Add code
Jul 22, 2024
Viaarxiv icon

Reference Channel Selection by Multi-Channel Masking for End-to-End Multi-Channel Speech Enhancement

Add code
Jun 05, 2024
Figure 1 for Reference Channel Selection by Multi-Channel Masking for End-to-End Multi-Channel Speech Enhancement
Figure 2 for Reference Channel Selection by Multi-Channel Masking for End-to-End Multi-Channel Speech Enhancement
Figure 3 for Reference Channel Selection by Multi-Channel Masking for End-to-End Multi-Channel Speech Enhancement
Figure 4 for Reference Channel Selection by Multi-Channel Masking for End-to-End Multi-Channel Speech Enhancement
Viaarxiv icon

Speaker Distance Estimation in Enclosures from Single-Channel Audio

Add code
Mar 26, 2024
Viaarxiv icon

From Weak to Strong Sound Event Labels using Adaptive Change-Point Detection and Active Learning

Add code
Mar 13, 2024
Viaarxiv icon