Picture for Hu Hu

Hu Hu

Bayesian adaptive learning to latent variables via Variational Bayes and Maximum a Posteriori

Add code
Jan 24, 2024
Viaarxiv icon

TeachCLIP: Multi-Grained Teaching for Efficient Text-to-Video Retrieval

Add code
Aug 02, 2023
Viaarxiv icon

A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification

Add code
Mar 31, 2022
Figure 1 for A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification
Figure 2 for A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification
Figure 3 for A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification
Figure 4 for A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification
Viaarxiv icon

A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer

Add code
Oct 16, 2021
Figure 1 for A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer
Figure 2 for A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer
Figure 3 for A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer
Figure 4 for A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer
Viaarxiv icon

A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming

Add code
Oct 08, 2021
Figure 1 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Figure 2 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Figure 3 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Figure 4 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Viaarxiv icon

A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification

Add code
Jul 03, 2021
Figure 1 for A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification
Figure 2 for A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification
Figure 3 for A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification
Figure 4 for A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification
Viaarxiv icon

REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling

Add code
Dec 14, 2020
Figure 1 for REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling
Figure 2 for REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling
Viaarxiv icon

A Two-Stage Approach to Device-Robust Acoustic Scene Classification

Add code
Nov 03, 2020
Figure 1 for A Two-Stage Approach to Device-Robust Acoustic Scene Classification
Figure 2 for A Two-Stage Approach to Device-Robust Acoustic Scene Classification
Figure 3 for A Two-Stage Approach to Device-Robust Acoustic Scene Classification
Figure 4 for A Two-Stage Approach to Device-Robust Acoustic Scene Classification
Viaarxiv icon

Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation

Add code
Aug 27, 2020
Figure 1 for Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation
Figure 2 for Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation
Figure 3 for Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation
Viaarxiv icon

Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement

Add code
Aug 03, 2020
Figure 1 for Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement
Figure 2 for Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement
Figure 3 for Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement
Figure 4 for Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement
Viaarxiv icon