Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Multi-Convformer: Extending Conformer with Multiple Convolution Kernels

Jul 04, 2024

Darshan Prabhu, Yifan Peng, Preethi Jyothi, Shinji Watanabe

Figure 1 for Multi-Convformer: Extending Conformer with Multiple Convolution Kernels

Figure 2 for Multi-Convformer: Extending Conformer with Multiple Convolution Kernels

Figure 3 for Multi-Convformer: Extending Conformer with Multiple Convolution Kernels

Figure 4 for Multi-Convformer: Extending Conformer with Multiple Convolution Kernels

Share this with someone who'll enjoy it:

Abstract:Convolutions have become essential in state-of-the-art end-to-end Automatic Speech Recognition~(ASR) systems due to their efficient modelling of local context. Notably, its use in Conformers has led to superior performance compared to vanilla Transformer-based ASR systems. While components other than the convolution module in the Conformer have been reexamined, altering the convolution module itself has been far less explored. Towards this, we introduce Multi-Convformer that uses multiple convolution kernels within the convolution module of the Conformer in conjunction with gating. This helps in improved modeling of local dependencies at varying granularities. Our model rivals existing Conformer variants such as CgMLP and E-Branchformer in performance, while being more parameter efficient. We empirically compare our approach with Conformer and its variants across four different datasets and three different modelling paradigms and show up to 8% relative word error rate~(WER) improvements.

* Accepted to INTERSPEECH 2024

View paper on

Share this with someone who'll enjoy it:

Title:Multi-Convformer: Extending Conformer with Multiple Convolution Kernels

Paper and Code