Picture for Antonio Miguel

Antonio Miguel

Audio-Visual Speaker Diarization: Current Databases, Approaches and Challenges

Add code
Sep 09, 2024
Viaarxiv icon

Defining and Measuring Disentanglement for non-Independent Factors of Variation

Add code
Aug 13, 2024
Viaarxiv icon

Predefined Prototypes for Intra-Class Separation and Disentanglement

Add code
Jun 23, 2024
Figure 1 for Predefined Prototypes for Intra-Class Separation and Disentanglement
Figure 2 for Predefined Prototypes for Intra-Class Separation and Disentanglement
Viaarxiv icon

Permutation Invariant Recurrent Neural Networks for Sound Source Tracking Applications

Add code
Jun 14, 2023
Figure 1 for Permutation Invariant Recurrent Neural Networks for Sound Source Tracking Applications
Figure 2 for Permutation Invariant Recurrent Neural Networks for Sound Source Tracking Applications
Figure 3 for Permutation Invariant Recurrent Neural Networks for Sound Source Tracking Applications
Figure 4 for Permutation Invariant Recurrent Neural Networks for Sound Source Tracking Applications
Viaarxiv icon

Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs

Add code
Mar 31, 2022
Figure 1 for Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs
Figure 2 for Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs
Figure 3 for Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs
Figure 4 for Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs
Viaarxiv icon

Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems

Add code
Nov 06, 2021
Figure 1 for Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems
Figure 2 for Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems
Figure 3 for Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems
Figure 4 for Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems
Viaarxiv icon

Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data

Add code
Oct 27, 2021
Figure 1 for Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data
Figure 2 for Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data
Figure 3 for Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data
Viaarxiv icon

Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks

Add code
Jun 16, 2020
Figure 1 for Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks
Figure 2 for Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks
Figure 3 for Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks
Figure 4 for Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks
Viaarxiv icon

Optimization of the Area Under the ROC Curve using Neural Network Supervectors for Text-Dependent Speaker Verification

Add code
Jan 31, 2019
Figure 1 for Optimization of the Area Under the ROC Curve using Neural Network Supervectors for Text-Dependent Speaker Verification
Figure 2 for Optimization of the Area Under the ROC Curve using Neural Network Supervectors for Text-Dependent Speaker Verification
Figure 3 for Optimization of the Area Under the ROC Curve using Neural Network Supervectors for Text-Dependent Speaker Verification
Figure 4 for Optimization of the Area Under the ROC Curve using Neural Network Supervectors for Text-Dependent Speaker Verification
Viaarxiv icon

Disentangling in Variational Autoencoders with Natural Clustering

Add code
Jan 27, 2019
Figure 1 for Disentangling in Variational Autoencoders with Natural Clustering
Figure 2 for Disentangling in Variational Autoencoders with Natural Clustering
Figure 3 for Disentangling in Variational Autoencoders with Natural Clustering
Figure 4 for Disentangling in Variational Autoencoders with Natural Clustering
Viaarxiv icon