Picture for Fadi Biadsy

Fadi Biadsy

Zero-shot Cross-lingual Voice Transfer for TTS

Add code
Sep 20, 2024
Figure 1 for Zero-shot Cross-lingual Voice Transfer for TTS
Figure 2 for Zero-shot Cross-lingual Voice Transfer for TTS
Viaarxiv icon

Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models

Add code
Mar 25, 2024
Figure 1 for Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models
Figure 2 for Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models
Figure 3 for Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models
Figure 4 for Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models
Viaarxiv icon

Streaming Parrotron for on-device speech-to-speech conversion

Add code
Oct 25, 2022
Viaarxiv icon

Non-Parallel Voice Conversion for ASR Augmentation

Add code
Sep 15, 2022
Figure 1 for Non-Parallel Voice Conversion for ASR Augmentation
Figure 2 for Non-Parallel Voice Conversion for ASR Augmentation
Figure 3 for Non-Parallel Voice Conversion for ASR Augmentation
Figure 4 for Non-Parallel Voice Conversion for ASR Augmentation
Viaarxiv icon

A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization

Add code
Mar 23, 2022
Figure 1 for A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization
Figure 2 for A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization
Figure 3 for A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization
Viaarxiv icon

Real time spectrogram inversion on mobile phone

Add code
Mar 10, 2022
Figure 1 for Real time spectrogram inversion on mobile phone
Figure 2 for Real time spectrogram inversion on mobile phone
Figure 3 for Real time spectrogram inversion on mobile phone
Figure 4 for Real time spectrogram inversion on mobile phone
Viaarxiv icon

Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech

Add code
Sep 14, 2021
Figure 1 for Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech
Figure 2 for Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech
Figure 3 for Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech
Figure 4 for Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech
Viaarxiv icon

Direct speech-to-speech translation with a sequence-to-sequence model

Add code
Apr 12, 2019
Figure 1 for Direct speech-to-speech translation with a sequence-to-sequence model
Figure 2 for Direct speech-to-speech translation with a sequence-to-sequence model
Figure 3 for Direct speech-to-speech translation with a sequence-to-sequence model
Figure 4 for Direct speech-to-speech translation with a sequence-to-sequence model
Viaarxiv icon