Picture for Boqing Zhu

Boqing Zhu

Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast

Add code
May 02, 2022
Figure 1 for Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
Figure 2 for Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
Figure 3 for Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
Figure 4 for Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
Viaarxiv icon

Audio Tagging by Cross Filtering Noisy Labels

Add code
Jul 16, 2020
Figure 1 for Audio Tagging by Cross Filtering Noisy Labels
Figure 2 for Audio Tagging by Cross Filtering Noisy Labels
Figure 3 for Audio Tagging by Cross Filtering Noisy Labels
Figure 4 for Audio Tagging by Cross Filtering Noisy Labels
Viaarxiv icon

General audio tagging with ensembling convolutional neural network and statistical features

Add code
Oct 30, 2018
Figure 1 for General audio tagging with ensembling convolutional neural network and statistical features
Figure 2 for General audio tagging with ensembling convolutional neural network and statistical features
Figure 3 for General audio tagging with ensembling convolutional neural network and statistical features
Figure 4 for General audio tagging with ensembling convolutional neural network and statistical features
Viaarxiv icon

Mixup-Based Acoustic Scene Classification Using Multi-Channel Convolutional Neural Network

Add code
May 18, 2018
Figure 1 for Mixup-Based Acoustic Scene Classification Using Multi-Channel Convolutional Neural Network
Figure 2 for Mixup-Based Acoustic Scene Classification Using Multi-Channel Convolutional Neural Network
Figure 3 for Mixup-Based Acoustic Scene Classification Using Multi-Channel Convolutional Neural Network
Figure 4 for Mixup-Based Acoustic Scene Classification Using Multi-Channel Convolutional Neural Network
Viaarxiv icon