Picture for Tatiana Likhomanenko

Tatiana Likhomanenko

Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis

Add code
Nov 26, 2024
Figure 1 for Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis
Figure 2 for Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis
Figure 3 for Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis
Figure 4 for Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis
Viaarxiv icon

Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels

Add code
Sep 16, 2024
Viaarxiv icon

Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models

Add code
Sep 16, 2024
Viaarxiv icon

Towards Automatic Assessment of Self-Supervised Speech Models using Rank

Add code
Sep 16, 2024
Viaarxiv icon

Theory, Analysis, and Best Practices for Sigmoid Self-Attention

Add code
Sep 06, 2024
Viaarxiv icon

Generating Gender Alternatives in Machine Translation

Add code
Jul 29, 2024
Viaarxiv icon

dMel: Speech Tokenization made Simple

Add code
Jul 22, 2024
Figure 1 for dMel: Speech Tokenization made Simple
Figure 2 for dMel: Speech Tokenization made Simple
Figure 3 for dMel: Speech Tokenization made Simple
Figure 4 for dMel: Speech Tokenization made Simple
Viaarxiv icon

Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition

Add code
May 24, 2024
Figure 1 for Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Figure 2 for Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Figure 3 for Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Figure 4 for Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Viaarxiv icon

Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?

Add code
Feb 01, 2024
Viaarxiv icon

AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition

Add code
Sep 29, 2023
Viaarxiv icon