Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Emad M. Grais

Analysing Wideband Absorbance Immittance in Normal and Ears with Otitis Media with Effusion Using Machine Learning

Mar 04, 2021

Emad M. Grais, Xiaoya Wang, Jie Wang, Fei Zhao, Wen Jiang, Yuexin Cai, Lifang Zhang, Qingwen Lin, Haidi Yang

Figure 1 for Analysing Wideband Absorbance Immittance in Normal and Ears with Otitis Media with Effusion Using Machine Learning

Figure 2 for Analysing Wideband Absorbance Immittance in Normal and Ears with Otitis Media with Effusion Using Machine Learning

Figure 3 for Analysing Wideband Absorbance Immittance in Normal and Ears with Otitis Media with Effusion Using Machine Learning

Figure 4 for Analysing Wideband Absorbance Immittance in Normal and Ears with Otitis Media with Effusion Using Machine Learning

Abstract:Wideband Absorbance Immittance (WAI) has been available for more than a decade, however its clinical use still faces the challenges of limited understanding and poor interpretation of WAI results. This study aimed to develop Machine Learning (ML) tools to identify the WAI absorbance characteristics across different frequency-pressure regions in the normal middle ear and ears with otitis media with effusion (OME) to enable diagnosis of middle ear conditions automatically. Data analysis including pre-processing of the WAI data, statistical analysis and classification model development, together with key regions extraction from the 2D frequency-pressure WAI images are conducted in this study. Our experimental results show that ML tools appear to hold great potential for the automated diagnosis of middle ear diseases from WAI data. The identified key regions in the WAI provide guidance to practitioners to better understand and interpret WAI data and offer the prospect of quick and accurate diagnostic decisions.

Via

Access Paper or Ask Questions

Multi-Band Multi-Resolution Fully Convolutional Neural Networks for Singing Voice Separation

Oct 21, 2019

Emad M. Grais, Fei Zhao, Mark D. Plumbley

Figure 1 for Multi-Band Multi-Resolution Fully Convolutional Neural Networks for Singing Voice Separation

Figure 2 for Multi-Band Multi-Resolution Fully Convolutional Neural Networks for Singing Voice Separation

Figure 3 for Multi-Band Multi-Resolution Fully Convolutional Neural Networks for Singing Voice Separation

Figure 4 for Multi-Band Multi-Resolution Fully Convolutional Neural Networks for Singing Voice Separation

Abstract:Deep neural networks with convolutional layers usually process the entire spectrogram of an audio signal with the same time-frequency resolutions, number of filters, and dimensionality reduction scale. According to the constant-Q transform, good features can be extracted from audio signals if the low frequency bands are processed with high frequency resolution filters and the high frequency bands with high time resolution filters. In the spectrogram of a mixture of singing voices and music signals, there is usually more information about the voice in the low frequency bands than the high frequency bands. These raise the need for processing each part of the spectrogram differently. In this paper, we propose a multi-band multi-resolution fully convolutional neural network (MBR-FCN) for singing voice separation. The MBR-FCN processes the frequency bands that have more information about the target signals with more filters and smaller dimentionality reduction scale than the bands with less information. Furthermore, the MBR-FCN processes the low frequency bands with high frequency resolution filters and the high frequency bands with high time resolution filters. Our experimental results show that the proposed MBR-FCN with very few parameters achieves better singing voice separation performance than other deep neural networks.

Via

Access Paper or Ask Questions

Referenceless Performance Evaluation of Audio Source Separation using Deep Neural Networks

Nov 01, 2018

Emad M. Grais, Hagen Wierstorf, Dominic Ward, Russell Mason, Mark D. Plumbley

Figure 1 for Referenceless Performance Evaluation of Audio Source Separation using Deep Neural Networks

Figure 2 for Referenceless Performance Evaluation of Audio Source Separation using Deep Neural Networks

Figure 3 for Referenceless Performance Evaluation of Audio Source Separation using Deep Neural Networks

Abstract:Current performance evaluation for audio source separation depends on comparing the processed or separated signals with reference signals. Therefore, common performance evaluation toolkits are not applicable to real-world situations where the ground truth audio is unavailable. In this paper, we propose a performance evaluation technique that does not require reference signals in order to assess separation quality. The proposed technique uses a deep neural network (DNN) to map the processed audio into its quality score. Our experiment results show that the DNN is capable of predicting the sources-to-artifacts ratio from the blind source separation evaluation toolkit without the need for reference signals.

Via

Access Paper or Ask Questions

Raw Multi-Channel Audio Source Separation using Multi-Resolution Convolutional Auto-Encoders

Mar 02, 2018

Emad M. Grais, Dominic Ward, Mark D. Plumbley

Figure 1 for Raw Multi-Channel Audio Source Separation using Multi-Resolution Convolutional Auto-Encoders

Figure 2 for Raw Multi-Channel Audio Source Separation using Multi-Resolution Convolutional Auto-Encoders

Figure 3 for Raw Multi-Channel Audio Source Separation using Multi-Resolution Convolutional Auto-Encoders

Figure 4 for Raw Multi-Channel Audio Source Separation using Multi-Resolution Convolutional Auto-Encoders

Abstract:Supervised multi-channel audio source separation requires extracting useful spectral, temporal, and spatial features from the mixed signals. The success of many existing systems is therefore largely dependent on the choice of features used for training. In this work, we introduce a novel multi-channel, multi-resolution convolutional auto-encoder neural network that works on raw time-domain signals to determine appropriate multi-resolution features for separating the singing-voice from stereo music. Our experimental results show that the proposed method can achieve multi-channel audio source separation without the need for hand-crafted features or any pre- or post-processing.

Via

Access Paper or Ask Questions

Multi-Resolution Fully Convolutional Neural Networks for Monaural Audio Source Separation

Oct 28, 2017

Emad M. Grais, Hagen Wierstorf, Dominic Ward, Mark D. Plumbley

Figure 1 for Multi-Resolution Fully Convolutional Neural Networks for Monaural Audio Source Separation

Figure 2 for Multi-Resolution Fully Convolutional Neural Networks for Monaural Audio Source Separation

Figure 3 for Multi-Resolution Fully Convolutional Neural Networks for Monaural Audio Source Separation

Figure 4 for Multi-Resolution Fully Convolutional Neural Networks for Monaural Audio Source Separation

Abstract:In deep neural networks with convolutional layers, each layer typically has fixed-size/single-resolution receptive field (RF). Convolutional layers with a large RF capture global information from the input features, while layers with small RF size capture local details with high resolution from the input features. In this work, we introduce novel deep multi-resolution fully convolutional neural networks (MR-FCNN), where each layer has different RF sizes to extract multi-resolution features that capture the global and local details information from its input features. The proposed MR-FCNN is applied to separate a target audio source from a mixture of many audio sources. Experimental results show that using MR-FCNN improves the performance compared to feedforward deep neural networks (DNNs) and single resolution deep fully convolutional neural networks (FCNNs) on the audio source separation problem.

* arXiv admin note: text overlap with arXiv:1703.08019

Via

Access Paper or Ask Questions

Deep neural networks for single channel source separation

Nov 12, 2013

Emad M. Grais, Mehmet Umut Sen, Hakan Erdogan

Figure 1 for Deep neural networks for single channel source separation

Figure 2 for Deep neural networks for single channel source separation

Figure 3 for Deep neural networks for single channel source separation

Figure 4 for Deep neural networks for single channel source separation

Abstract:In this paper, a novel approach for single channel source separation (SCSS) using a deep neural network (DNN) architecture is introduced. Unlike previous studies in which DNN and other classifiers were used for classifying time-frequency bins to obtain hard masks for each source, we use the DNN to classify estimated source spectra to check for their validity during separation. In the training stage, the training data for the source signals are used to train a DNN. In the separation stage, the trained DNN is utilized to aid in estimation of each source in the mixed signal. Single channel source separation problem is formulated as an energy minimization problem where each source spectra estimate is encouraged to fit the trained DNN model and the mixed signal spectrum is encouraged to be written as a weighted sum of the estimated source spectra. The proposed approach works regardless of the energy scale differences between the source signals in the training and separation stages. Nonnegative matrix factorization (NMF) is used to initialize the DNN estimate for each source. The experimental results show that using DNN initialized by NMF for source separation improves the quality of the separated signal compared with using NMF for source separation.

* 5 pages, 2 figures, 2 tables, submitted to ICASSP2014

Via

Access Paper or Ask Questions

Source Separation using Regularized NMF with MMSE Estimates under GMM Priors with Online Learning for The Uncertainties

Feb 28, 2013

Emad M. Grais, Hakan Erdogan

Figure 1 for Source Separation using Regularized NMF with MMSE Estimates under GMM Priors with Online Learning for The Uncertainties

Figure 2 for Source Separation using Regularized NMF with MMSE Estimates under GMM Priors with Online Learning for The Uncertainties

Figure 3 for Source Separation using Regularized NMF with MMSE Estimates under GMM Priors with Online Learning for The Uncertainties

Figure 4 for Source Separation using Regularized NMF with MMSE Estimates under GMM Priors with Online Learning for The Uncertainties

Abstract:We propose a new method to enforce priors on the solution of the nonnegative matrix factorization (NMF). The proposed algorithm can be used for denoising or single-channel source separation (SCSS) applications. The NMF solution is guided to follow the Minimum Mean Square Error (MMSE) estimates under Gaussian mixture prior models (GMM) for the source signal. In SCSS applications, the spectra of the observed mixed signal are decomposed as a weighted linear combination of trained basis vectors for each source using NMF. In this work, the NMF decomposition weight matrices are treated as a distorted image by a distortion operator, which is learned directly from the observed signals. The MMSE estimate of the weights matrix under GMM prior and log-normal distribution for the distortion is then found to improve the NMF decomposition results. The MMSE estimate is embedded within the optimization objective to form a novel regularized NMF cost function. The corresponding update rules for the new objectives are derived in this paper. Experimental results show that, the proposed regularized NMF algorithm improves the source separation performance compared with using NMF without prior or with other prior models.

Via

Access Paper or Ask Questions