Picture for Chih-Wei Wu

Chih-Wei Wu

Facing the Music: Tackling Singing Voice Separation in Cinematic Audio Source Separation

Add code
Aug 07, 2024
Viaarxiv icon

Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support

Add code
Jul 09, 2024
Viaarxiv icon

ODAQ: Open Dataset of Audio Quality

Add code
Dec 30, 2023
Viaarxiv icon

A Generalized Bandsplit Neural Network for Cinematic Audio Source Separation

Add code
Sep 07, 2023
Viaarxiv icon

Looking Similar, Sounding Different: Leveraging Counterfactual Cross-Modal Pairs for Audiovisual Representation Learning

Add code
Apr 12, 2023
Viaarxiv icon

AVASpeech-SMAD: A Strongly Labelled Speech and Music Activity Detection Dataset with Label Co-Occurrence

Add code
Nov 02, 2021
Figure 1 for AVASpeech-SMAD: A Strongly Labelled Speech and Music Activity Detection Dataset with Label Co-Occurrence
Figure 2 for AVASpeech-SMAD: A Strongly Labelled Speech and Music Activity Detection Dataset with Label Co-Occurrence
Figure 3 for AVASpeech-SMAD: A Strongly Labelled Speech and Music Activity Detection Dataset with Label Co-Occurrence
Figure 4 for AVASpeech-SMAD: A Strongly Labelled Speech and Music Activity Detection Dataset with Label Co-Occurrence
Viaarxiv icon

Orientation-aware Vehicle Re-identification with Semantics-guided Part Attention Network

Add code
Aug 26, 2020
Figure 1 for Orientation-aware Vehicle Re-identification with Semantics-guided Part Attention Network
Figure 2 for Orientation-aware Vehicle Re-identification with Semantics-guided Part Attention Network
Figure 3 for Orientation-aware Vehicle Re-identification with Semantics-guided Part Attention Network
Figure 4 for Orientation-aware Vehicle Re-identification with Semantics-guided Part Attention Network
Viaarxiv icon

Spatially and Temporally Efficient Non-local Attention Network for Video-based Person Re-Identification

Add code
Aug 05, 2019
Figure 1 for Spatially and Temporally Efficient Non-local Attention Network for Video-based Person Re-Identification
Figure 2 for Spatially and Temporally Efficient Non-local Attention Network for Video-based Person Re-Identification
Figure 3 for Spatially and Temporally Efficient Non-local Attention Network for Video-based Person Re-Identification
Figure 4 for Spatially and Temporally Efficient Non-local Attention Network for Video-based Person Re-Identification
Viaarxiv icon

Learning to Fuse Music Genres with Generative Adversarial Dual Learning

Add code
Dec 05, 2017
Figure 1 for Learning to Fuse Music Genres with Generative Adversarial Dual Learning
Figure 2 for Learning to Fuse Music Genres with Generative Adversarial Dual Learning
Figure 3 for Learning to Fuse Music Genres with Generative Adversarial Dual Learning
Figure 4 for Learning to Fuse Music Genres with Generative Adversarial Dual Learning
Viaarxiv icon