Picture for Jesús Villalba

Jesús Villalba

Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification

Add code
Feb 29, 2024
Figure 1 for Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification
Figure 2 for Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification
Figure 3 for Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification
Figure 4 for Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification
Viaarxiv icon

Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning

Add code
Sep 08, 2023
Figure 1 for Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning
Figure 2 for Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning
Figure 3 for Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning
Figure 4 for Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning
Viaarxiv icon

Regularizing Contrastive Predictive Coding for Speech Applications

Add code
Apr 26, 2023
Figure 1 for Regularizing Contrastive Predictive Coding for Speech Applications
Figure 2 for Regularizing Contrastive Predictive Coding for Speech Applications
Figure 3 for Regularizing Contrastive Predictive Coding for Speech Applications
Figure 4 for Regularizing Contrastive Predictive Coding for Speech Applications
Viaarxiv icon

Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition

Add code
Mar 07, 2023
Viaarxiv icon

Time-domain speech super-resolution with GAN based modeling for telephony speaker verification

Add code
Sep 04, 2022
Figure 1 for Time-domain speech super-resolution with GAN based modeling for telephony speaker verification
Figure 2 for Time-domain speech super-resolution with GAN based modeling for telephony speaker verification
Figure 3 for Time-domain speech super-resolution with GAN based modeling for telephony speaker verification
Figure 4 for Time-domain speech super-resolution with GAN based modeling for telephony speaker verification
Viaarxiv icon

Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations

Add code
Aug 10, 2022
Figure 1 for Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations
Figure 2 for Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations
Figure 3 for Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations
Figure 4 for Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations
Viaarxiv icon

Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification

Add code
Mar 30, 2022
Figure 1 for Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification
Figure 2 for Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification
Figure 3 for Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification
Figure 4 for Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification
Viaarxiv icon

Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding

Add code
Oct 08, 2021
Figure 1 for Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Figure 2 for Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Figure 3 for Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Figure 4 for Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Viaarxiv icon

Beyond Isolated Utterances: Conversational Emotion Recognition

Add code
Sep 13, 2021
Figure 1 for Beyond Isolated Utterances: Conversational Emotion Recognition
Figure 2 for Beyond Isolated Utterances: Conversational Emotion Recognition
Figure 3 for Beyond Isolated Utterances: Conversational Emotion Recognition
Figure 4 for Beyond Isolated Utterances: Conversational Emotion Recognition
Viaarxiv icon

Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems

Add code
Jul 09, 2021
Figure 1 for Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems
Figure 2 for Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems
Figure 3 for Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems
Figure 4 for Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems
Viaarxiv icon