Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:NatiQ: An End-to-end Text-to-Speech System for Arabic

Jun 15, 2022

Ahmed Abdelali, Nadir Durrani, Cenk Demiroglu, Fahim Dalvi, Hamdy Mubarak, Kareem Darwish

Figure 1 for NatiQ: An End-to-end Text-to-Speech System for Arabic

Figure 2 for NatiQ: An End-to-end Text-to-Speech System for Arabic

Figure 3 for NatiQ: An End-to-end Text-to-Speech System for Arabic

Figure 4 for NatiQ: An End-to-end Text-to-Speech System for Arabic

Share this with someone who'll enjoy it:

Abstract:NatiQ is end-to-end text-to-speech system for Arabic. Our speech synthesizer uses an encoder-decoder architecture with attention. We used both tacotron-based models (tacotron-1 and tacotron-2) and the faster transformer model for generating mel-spectrograms from characters. We concatenated Tacotron1 with the WaveRNN vocoder, Tacotron2 with the WaveGlow vocoder and ESPnet transformer with the parallel wavegan vocoder to synthesize waveforms from the spectrograms. We used in-house speech data for two voices: 1) neutral male "Hamza"- narrating general content and news, and 2) expressive female "Amina"- narrating children story books to train our models. Our best systems achieve an average Mean Opinion Score (MOS) of 4.21 and 4.40 for Amina and Hamza respectively. The objective evaluation of the systems using word and character error rate (WER and CER) as well as the response time measured by real-time factor favored the end-to-end architecture ESPnet. NatiQ demo is available on-line at https://tts.qcri.org

View paper on

Share this with someone who'll enjoy it:

Title:NatiQ: An End-to-end Text-to-Speech System for Arabic

Paper and Code