Picture for Wim Boes

Wim Boes

Impact of visual assistance for automated audio captioning

Add code
Nov 18, 2022
Viaarxiv icon

Multi-Source Transformer Architectures for Audiovisual Scene Classification

Add code
Oct 18, 2022
Figure 1 for Multi-Source Transformer Architectures for Audiovisual Scene Classification
Figure 2 for Multi-Source Transformer Architectures for Audiovisual Scene Classification
Viaarxiv icon

Optimizing Temporal Resolution Of Convolutional Recurrent Neural Networks For Sound Event Detection

Add code
Oct 18, 2022
Figure 1 for Optimizing Temporal Resolution Of Convolutional Recurrent Neural Networks For Sound Event Detection
Figure 2 for Optimizing Temporal Resolution Of Convolutional Recurrent Neural Networks For Sound Event Detection
Viaarxiv icon

Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection

Add code
Sep 27, 2022
Figure 1 for Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection
Figure 2 for Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection
Figure 3 for Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection
Figure 4 for Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection
Viaarxiv icon

Multi-encoder attention-based architectures for sound recognition with partial visual assistance

Add code
Sep 26, 2022
Figure 1 for Multi-encoder attention-based architectures for sound recognition with partial visual assistance
Figure 2 for Multi-encoder attention-based architectures for sound recognition with partial visual assistance
Figure 3 for Multi-encoder attention-based architectures for sound recognition with partial visual assistance
Figure 4 for Multi-encoder attention-based architectures for sound recognition with partial visual assistance
Viaarxiv icon

On the long-term learning ability of LSTM LMs

Add code
Jun 16, 2021
Figure 1 for On the long-term learning ability of LSTM LMs
Figure 2 for On the long-term learning ability of LSTM LMs
Figure 3 for On the long-term learning ability of LSTM LMs
Figure 4 for On the long-term learning ability of LSTM LMs
Viaarxiv icon

Audiovisual transfer learning for audio tagging and sound event detection

Add code
Jun 09, 2021
Figure 1 for Audiovisual transfer learning for audio tagging and sound event detection
Figure 2 for Audiovisual transfer learning for audio tagging and sound event detection
Figure 3 for Audiovisual transfer learning for audio tagging and sound event detection
Figure 4 for Audiovisual transfer learning for audio tagging and sound event detection
Viaarxiv icon

Audiovisual Transformer Architectures for Large-Scale Classification and Synchronization of Weakly Labeled Audio Events

Add code
Dec 02, 2019
Figure 1 for Audiovisual Transformer Architectures for Large-Scale Classification and Synchronization of Weakly Labeled Audio Events
Figure 2 for Audiovisual Transformer Architectures for Large-Scale Classification and Synchronization of Weakly Labeled Audio Events
Figure 3 for Audiovisual Transformer Architectures for Large-Scale Classification and Synchronization of Weakly Labeled Audio Events
Figure 4 for Audiovisual Transformer Architectures for Large-Scale Classification and Synchronization of Weakly Labeled Audio Events
Viaarxiv icon