Picture for Jaeyoung Roh

Jaeyoung Roh

Triage knowledge distillation for speaker verification

Add code
Jan 21, 2026
Viaarxiv icon

MATE: Matryoshka Audio-Text Embeddings for Open-Vocabulary Keyword Spotting

Add code
Jan 20, 2026
Viaarxiv icon

DAME: Duration-Aware Matryoshka Embedding for Duration-Robust Speaker Verification

Add code
Jan 20, 2026
Viaarxiv icon

Adversarial Deep Metric Learning for Cross-Modal Audio-Text Alignment in Open-Vocabulary Keyword Spotting

Add code
May 22, 2025
Figure 1 for Adversarial Deep Metric Learning for Cross-Modal Audio-Text Alignment in Open-Vocabulary Keyword Spotting
Figure 2 for Adversarial Deep Metric Learning for Cross-Modal Audio-Text Alignment in Open-Vocabulary Keyword Spotting
Figure 3 for Adversarial Deep Metric Learning for Cross-Modal Audio-Text Alignment in Open-Vocabulary Keyword Spotting
Figure 4 for Adversarial Deep Metric Learning for Cross-Modal Audio-Text Alignment in Open-Vocabulary Keyword Spotting
Viaarxiv icon

CTC-aligned Audio-Text Embedding for Streaming Open-vocabulary Keyword Spotting

Add code
Jun 12, 2024
Viaarxiv icon

Relational Proxy Loss for Audio-Text based Keyword Spotting

Add code
Jun 08, 2024
Figure 1 for Relational Proxy Loss for Audio-Text based Keyword Spotting
Figure 2 for Relational Proxy Loss for Audio-Text based Keyword Spotting
Figure 3 for Relational Proxy Loss for Audio-Text based Keyword Spotting
Figure 4 for Relational Proxy Loss for Audio-Text based Keyword Spotting
Viaarxiv icon