Picture for Antonio Miguel

Antonio Miguel

Audio-Visual Speaker Diarization: Current Databases, Approaches and Challenges

Add code
Sep 09, 2024
Viaarxiv icon

Defining and Measuring Disentanglement for non-Independent Factors of Variation

Add code
Aug 13, 2024
Viaarxiv icon

Predefined Prototypes for Intra-Class Separation and Disentanglement

Add code
Jun 23, 2024
Viaarxiv icon

Permutation Invariant Recurrent Neural Networks for Sound Source Tracking Applications

Add code
Jun 14, 2023
Viaarxiv icon

Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs

Add code
Mar 31, 2022
Figure 1 for Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs
Figure 2 for Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs
Figure 3 for Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs
Figure 4 for Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs
Viaarxiv icon

Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems

Add code
Nov 06, 2021
Figure 1 for Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems
Figure 2 for Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems
Figure 3 for Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems
Figure 4 for Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems
Viaarxiv icon

Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data

Add code
Oct 27, 2021
Figure 1 for Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data
Figure 2 for Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data
Figure 3 for Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data
Viaarxiv icon

Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks

Add code
Jun 16, 2020
Figure 1 for Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks
Figure 2 for Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks
Figure 3 for Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks
Figure 4 for Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks
Viaarxiv icon

Optimization of the Area Under the ROC Curve using Neural Network Supervectors for Text-Dependent Speaker Verification

Add code
Jan 31, 2019
Figure 1 for Optimization of the Area Under the ROC Curve using Neural Network Supervectors for Text-Dependent Speaker Verification
Figure 2 for Optimization of the Area Under the ROC Curve using Neural Network Supervectors for Text-Dependent Speaker Verification
Figure 3 for Optimization of the Area Under the ROC Curve using Neural Network Supervectors for Text-Dependent Speaker Verification
Figure 4 for Optimization of the Area Under the ROC Curve using Neural Network Supervectors for Text-Dependent Speaker Verification
Viaarxiv icon

Disentangling in Variational Autoencoders with Natural Clustering

Add code
Jan 27, 2019
Figure 1 for Disentangling in Variational Autoencoders with Natural Clustering
Figure 2 for Disentangling in Variational Autoencoders with Natural Clustering
Figure 3 for Disentangling in Variational Autoencoders with Natural Clustering
Figure 4 for Disentangling in Variational Autoencoders with Natural Clustering
Viaarxiv icon