Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Quaternion Neural Networks for Multi-channel Distant Speech Recognition

May 19, 2020

Xinchi Qiu, Titouan Parcollet, Mirco Ravanelli, Nicholas Lane, Mohamed Morchid

Figure 1 for Quaternion Neural Networks for Multi-channel Distant Speech Recognition

Figure 2 for Quaternion Neural Networks for Multi-channel Distant Speech Recognition

Figure 3 for Quaternion Neural Networks for Multi-channel Distant Speech Recognition

Share this with someone who'll enjoy it:

Abstract:Despite the significant progress in automatic speech recognition (ASR), distant ASR remains challenging due to noise and reverberation. A common approach to mitigate this issue consists of equipping the recording devices with multiple microphones that capture the acoustic scene from different perspectives. These multi-channel audio recordings contain specific internal relations between each signal. In this paper, we propose to capture these inter- and intra- structural dependencies with quaternion neural networks, which can jointly process multiple signals as whole quaternion entities. The quaternion algebra replaces the standard dot product with the Hamilton one, thus offering a simple and elegant way to model dependencies between elements. The quaternion layers are then coupled with a recurrent neural network, which can learn long-term dependencies in the time domain. We show that a quaternion long-short term memory neural network (QLSTM), trained on the concatenated multi-channel speech signals, outperforms equivalent real-valued LSTM on two different tasks of multi-channel distant speech recognition.

* 4 pages

View paper on

Share this with someone who'll enjoy it:

Title:Quaternion Neural Networks for Multi-channel Distant Speech Recognition

Paper and Code