Picture for Anna Silnova

Anna Silnova

Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization

Add code
Nov 04, 2024
Figure 1 for Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization
Figure 2 for Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization
Figure 3 for Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization
Figure 4 for Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization
Viaarxiv icon

Leveraging Self-Supervised Learning for Speaker Diarization

Add code
Sep 14, 2024
Viaarxiv icon

BUT Systems and Analyses for the ASVspoof 5 Challenge

Add code
Aug 20, 2024
Viaarxiv icon

Challenging margin-based speaker embedding extractors by using the variational information bottleneck

Add code
Jun 18, 2024
Figure 1 for Challenging margin-based speaker embedding extractors by using the variational information bottleneck
Figure 2 for Challenging margin-based speaker embedding extractors by using the variational information bottleneck
Figure 3 for Challenging margin-based speaker embedding extractors by using the variational information bottleneck
Viaarxiv icon

Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?

Add code
Feb 29, 2024
Viaarxiv icon

Discriminative Training of VBx Diarization

Add code
Oct 04, 2023
Viaarxiv icon

Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization

Add code
May 23, 2023
Figure 1 for Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization
Figure 2 for Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization
Figure 3 for Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization
Viaarxiv icon

Toroidal Probabilistic Spherical Discriminant Analysis

Add code
Oct 27, 2022
Viaarxiv icon

Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries

Add code
Mar 29, 2022
Figure 1 for Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries
Figure 2 for Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries
Figure 3 for Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries
Viaarxiv icon

Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings

Add code
Mar 28, 2022
Figure 1 for Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings
Viaarxiv icon