Picture for Kyle Kastner

Kyle Kastner

NEUROSPIN, PARIETAL

Adaptive Accompaniment with ReaLchords

Add code
Jun 17, 2025
Viaarxiv icon

Zero-shot Cross-lingual Voice Transfer for TTS

Add code
Sep 20, 2024
Figure 1 for Zero-shot Cross-lingual Voice Transfer for TTS
Figure 2 for Zero-shot Cross-lingual Voice Transfer for TTS
Viaarxiv icon

Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting

Add code
Aug 20, 2024
Figure 1 for Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting
Figure 2 for Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting
Figure 3 for Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting
Figure 4 for Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting
Viaarxiv icon

Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model

Add code
Jul 26, 2024
Figure 1 for Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model
Figure 2 for Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model
Figure 3 for Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model
Figure 4 for Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model
Viaarxiv icon

Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data

Add code
Feb 29, 2024
Figure 1 for Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data
Figure 2 for Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data
Figure 3 for Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data
Figure 4 for Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data
Viaarxiv icon

High-precision Voice Search Query Correction via Retrievable Speech-text Embedings

Add code
Jan 08, 2024
Viaarxiv icon

Understanding Shared Speech-Text Representations

Add code
Apr 27, 2023
Viaarxiv icon

R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS

Add code
Jun 30, 2022
Figure 1 for R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS
Figure 2 for R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS
Figure 3 for R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS
Figure 4 for R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS
Viaarxiv icon

MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling

Add code
Dec 17, 2021
Figure 1 for MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling
Figure 2 for MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling
Figure 3 for MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling
Figure 4 for MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling
Viaarxiv icon

Planning in Dynamic Environments with Conditional Autoregressive Models

Add code
Nov 25, 2018
Figure 1 for Planning in Dynamic Environments with Conditional Autoregressive Models
Figure 2 for Planning in Dynamic Environments with Conditional Autoregressive Models
Viaarxiv icon