Picture for Zheng-Hua Tan

Zheng-Hua Tan

Aalborg University, Pioneer Centre for AI, Denmark

BiSSL: Bilevel Optimization for Self-Supervised Pre-Training and Fine-Tuning

Add code
Oct 03, 2024
Viaarxiv icon

Detecting and Defending Against Adversarial Attacks on Automatic Speech Recognition via Diffusion Models

Add code
Sep 12, 2024
Viaarxiv icon

Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder

Add code
Sep 05, 2024
Viaarxiv icon

Audio xLSTMs: Learning Self-Supervised Audio Representations with xLSTMs

Add code
Sep 02, 2024
Viaarxiv icon

The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems

Add code
Jun 10, 2024
Viaarxiv icon

Zero-Shot Audio Captioning Using Soft and Hard Prompts

Add code
Jun 10, 2024
Viaarxiv icon

Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations

Add code
Jun 04, 2024
Viaarxiv icon

Noise-Robust Keyword Spotting through Self-supervised Pretraining

Add code
Mar 27, 2024
Viaarxiv icon

Neural Networks Hear You Loud And Clear: Hearing Loss Compensation Using Deep Neural Networks

Add code
Mar 15, 2024
Viaarxiv icon

How to train your ears: Auditory-model emulation for large-dynamic-range inputs and mild-to-severe hearing losses

Add code
Mar 15, 2024
Viaarxiv icon