Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages

Jun 13, 2023

Claytone Sikasote, Kalinda Siaminwe, Stanly Mwape, Bangiwe Zulu, Mofya Phiri, Martin Phiri, David Zulu, Mayumbo Nyirenda, Antonios Anastasopoulos

Figure 1 for Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages

Figure 2 for Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages

Figure 3 for Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages

Figure 4 for Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages

Share this with someone who'll enjoy it:

Abstract:This work introduces Zambezi Voice, an open-source multilingual speech resource for Zambian languages. It contains two collections of datasets: unlabelled audio recordings of radio news and talk shows programs (160 hours) and labelled data (over 80 hours) consisting of read speech recorded from text sourced from publicly available literature books. The dataset is created for speech recognition but can be extended to multilingual speech processing research for both supervised and unsupervised learning approaches. To our knowledge, this is the first multilingual speech dataset created for Zambian languages. We exploit pretraining and cross-lingual transfer learning by finetuning the Wav2Vec2.0 large-scale multilingual pre-trained model to build end-to-end (E2E) speech recognition models for our baseline models. The dataset is released publicly under a Creative Commons BY-NC-ND 4.0 license and can be accessed via https://github.com/unza-speech-lab/zambezi-voice .

* Accepted at INTERSPEECH 2023. This pre-print version differs slightly from the version accepted to INTERSPEECH 2023: Figure 1 is not included in INTERSPEECH 2023!

View paper on

Share this with someone who'll enjoy it:

Title:Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages

Paper and Code