Picture for Donald S. Williamson

Donald S. Williamson

A Pre-training Framework that Encodes Noise Information for Speech Quality Assessment

Add code
Nov 07, 2024
Figure 1 for A Pre-training Framework that Encodes Noise Information for Speech Quality Assessment
Figure 2 for A Pre-training Framework that Encodes Noise Information for Speech Quality Assessment
Figure 3 for A Pre-training Framework that Encodes Noise Information for Speech Quality Assessment
Figure 4 for A Pre-training Framework that Encodes Noise Information for Speech Quality Assessment
Viaarxiv icon

A contrastive-learning approach for auditory attention detection

Add code
Oct 24, 2024
Viaarxiv icon

Using RLHF to align speech enhancement approaches to mean-opinion quality scores

Add code
Oct 17, 2024
Viaarxiv icon

SWIM: An Attention-Only Model for Speech Quality Assessment Under Subjective Variance

Add code
Oct 16, 2024
Viaarxiv icon

MMViT: Multiscale Multiview Vision Transformers

Add code
Apr 28, 2023
Viaarxiv icon

Attention-based Speech Enhancement Using Human Quality Perception Modelling

Add code
Mar 23, 2023
Viaarxiv icon

A Composite T60 Regression and Classification Approach for Speech Dereverberation

Add code
Feb 09, 2023
Viaarxiv icon

ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications

Add code
Apr 01, 2022
Figure 1 for ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications
Figure 2 for ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications
Figure 3 for ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications
Figure 4 for ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications
Viaarxiv icon

Multi-channel Multi-frame ADL-MVDR for Target Speech Separation

Add code
Dec 24, 2020
Figure 1 for Multi-channel Multi-frame ADL-MVDR for Target Speech Separation
Figure 2 for Multi-channel Multi-frame ADL-MVDR for Target Speech Separation
Figure 3 for Multi-channel Multi-frame ADL-MVDR for Target Speech Separation
Figure 4 for Multi-channel Multi-frame ADL-MVDR for Target Speech Separation
Viaarxiv icon

A Pyramid Recurrent Network for Predicting Crowdsourced Speech-Quality Ratings of Real-World Signals

Add code
Jul 31, 2020
Figure 1 for A Pyramid Recurrent Network for Predicting Crowdsourced Speech-Quality Ratings of Real-World Signals
Figure 2 for A Pyramid Recurrent Network for Predicting Crowdsourced Speech-Quality Ratings of Real-World Signals
Figure 3 for A Pyramid Recurrent Network for Predicting Crowdsourced Speech-Quality Ratings of Real-World Signals
Figure 4 for A Pyramid Recurrent Network for Predicting Crowdsourced Speech-Quality Ratings of Real-World Signals
Viaarxiv icon