Picture for Themos Stafylakis

Themos Stafylakis

LabelBuddy: An Open Source Music and Audio Language Annotation Tagging Tool Using AI Assistance

Add code
Mar 04, 2026
Viaarxiv icon

Alpha Divergence Losses for Biometric Verification

Add code
Nov 19, 2025
Viaarxiv icon

Analysis of ABC Frontend Audio Systems for the NIST-SRE24

Add code
May 21, 2025
Viaarxiv icon

State-of-the-art Embeddings with Video-free Segmentation of the Source VoxCeleb Data

Add code
Oct 03, 2024
Figure 1 for State-of-the-art Embeddings with Video-free Segmentation of the Source VoxCeleb Data
Figure 2 for State-of-the-art Embeddings with Video-free Segmentation of the Source VoxCeleb Data
Figure 3 for State-of-the-art Embeddings with Video-free Segmentation of the Source VoxCeleb Data
Figure 4 for State-of-the-art Embeddings with Video-free Segmentation of the Source VoxCeleb Data
Viaarxiv icon

BUT Systems and Analyses for the ASVspoof 5 Challenge

Add code
Aug 20, 2024
Figure 1 for BUT Systems and Analyses for the ASVspoof 5 Challenge
Figure 2 for BUT Systems and Analyses for the ASVspoof 5 Challenge
Figure 3 for BUT Systems and Analyses for the ASVspoof 5 Challenge
Figure 4 for BUT Systems and Analyses for the ASVspoof 5 Challenge
Viaarxiv icon

Challenging margin-based speaker embedding extractors by using the variational information bottleneck

Add code
Jun 18, 2024
Figure 1 for Challenging margin-based speaker embedding extractors by using the variational information bottleneck
Figure 2 for Challenging margin-based speaker embedding extractors by using the variational information bottleneck
Figure 3 for Challenging margin-based speaker embedding extractors by using the variational information bottleneck
Viaarxiv icon

Comparing Data Augmentation Methods for End-to-End Task-Oriented Dialog Systems

Add code
Jun 10, 2024
Figure 1 for Comparing Data Augmentation Methods for End-to-End Task-Oriented Dialog Systems
Figure 2 for Comparing Data Augmentation Methods for End-to-End Task-Oriented Dialog Systems
Figure 3 for Comparing Data Augmentation Methods for End-to-End Task-Oriented Dialog Systems
Figure 4 for Comparing Data Augmentation Methods for End-to-End Task-Oriented Dialog Systems
Viaarxiv icon

Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?

Add code
Feb 29, 2024
Figure 1 for Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?
Figure 2 for Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?
Figure 3 for Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?
Figure 4 for Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?
Viaarxiv icon

DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors

Add code
Dec 22, 2023
Figure 1 for DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors
Figure 2 for DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors
Figure 3 for DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors
Figure 4 for DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors
Viaarxiv icon

A Simple Baseline for Knowledge-Based Visual Question Answering

Add code
Oct 24, 2023
Figure 1 for A Simple Baseline for Knowledge-Based Visual Question Answering
Figure 2 for A Simple Baseline for Knowledge-Based Visual Question Answering
Figure 3 for A Simple Baseline for Knowledge-Based Visual Question Answering
Figure 4 for A Simple Baseline for Knowledge-Based Visual Question Answering
Viaarxiv icon