Picture for Kris Demuynck

Kris Demuynck

BEST-STD: Bidirectional Mamba-Enhanced Speech Tokenization for Spoken Term Detection

Add code
Nov 21, 2024
Figure 1 for BEST-STD: Bidirectional Mamba-Enhanced Speech Tokenization for Spoken Term Detection
Figure 2 for BEST-STD: Bidirectional Mamba-Enhanced Speech Tokenization for Spoken Term Detection
Figure 3 for BEST-STD: Bidirectional Mamba-Enhanced Speech Tokenization for Spoken Term Detection
Figure 4 for BEST-STD: Bidirectional Mamba-Enhanced Speech Tokenization for Spoken Term Detection
Viaarxiv icon

Speaker Embeddings With Weakly Supervised Voice Activity Detection For Efficient Speaker Diarization

Add code
May 15, 2024
Viaarxiv icon

ECAPA2: A Hybrid Neural Network Architecture and Training Strategy for Robust Speaker Embeddings

Add code
Jan 16, 2024
Viaarxiv icon

BioLORD-2023: Semantic Textual Representations Fusing LLM and Clinical Knowledge Graph Insights

Add code
Nov 27, 2023
Figure 1 for BioLORD-2023: Semantic Textual Representations Fusing LLM and Clinical Knowledge Graph Insights
Figure 2 for BioLORD-2023: Semantic Textual Representations Fusing LLM and Clinical Knowledge Graph Insights
Figure 3 for BioLORD-2023: Semantic Textual Representations Fusing LLM and Clinical Knowledge Graph Insights
Figure 4 for BioLORD-2023: Semantic Textual Representations Fusing LLM and Clinical Knowledge Graph Insights
Viaarxiv icon

Tik-to-Tok: Translating Language Models One Token at a Time: An Embedding Initialization Strategy for Efficient Language Adaptation

Add code
Oct 05, 2023
Figure 1 for Tik-to-Tok: Translating Language Models One Token at a Time: An Embedding Initialization Strategy for Efficient Language Adaptation
Figure 2 for Tik-to-Tok: Translating Language Models One Token at a Time: An Embedding Initialization Strategy for Efficient Language Adaptation
Figure 3 for Tik-to-Tok: Translating Language Models One Token at a Time: An Embedding Initialization Strategy for Efficient Language Adaptation
Figure 4 for Tik-to-Tok: Translating Language Models One Token at a Time: An Embedding Initialization Strategy for Efficient Language Adaptation
Viaarxiv icon

Behavioral Analysis of Pathological Speaker Embeddings of Patients During Oncological Treatment of Oral Cancer

Add code
Jul 10, 2023
Viaarxiv icon

Margin-Mixup: A Method for Robust Speaker Verification in Multi-Speaker Audio

Add code
Apr 07, 2023
Viaarxiv icon

Simultaneously Learning Robust Audio Embeddings and balanced Hash codes for Query-by-Example

Add code
Nov 20, 2022
Viaarxiv icon

Attention-Based Audio Embeddings for Query-by-Example

Add code
Oct 16, 2022
Figure 1 for Attention-Based Audio Embeddings for Query-by-Example
Figure 2 for Attention-Based Audio Embeddings for Query-by-Example
Figure 3 for Attention-Based Audio Embeddings for Query-by-Example
Figure 4 for Attention-Based Audio Embeddings for Query-by-Example
Viaarxiv icon

Transfer Learning for Robust Low-Resource Children's Speech ASR with Transformers and Source-Filter Warping

Add code
Jun 19, 2022
Figure 1 for Transfer Learning for Robust Low-Resource Children's Speech ASR with Transformers and Source-Filter Warping
Figure 2 for Transfer Learning for Robust Low-Resource Children's Speech ASR with Transformers and Source-Filter Warping
Figure 3 for Transfer Learning for Robust Low-Resource Children's Speech ASR with Transformers and Source-Filter Warping
Figure 4 for Transfer Learning for Robust Low-Resource Children's Speech ASR with Transformers and Source-Filter Warping
Viaarxiv icon