Picture for Bang Zeng

Bang Zeng

Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection

Add code
Jan 07, 2025
Figure 1 for Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection
Figure 2 for Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection
Figure 3 for Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection
Figure 4 for Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection
Viaarxiv icon

TSELM: Target Speaker Extraction using Discrete Tokens and Language Models

Add code
Sep 12, 2024
Viaarxiv icon

USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction

Add code
Sep 04, 2024
Viaarxiv icon

Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios(V1)

Add code
Jun 17, 2022
Figure 1 for Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios(V1)
Figure 2 for Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios(V1)
Figure 3 for Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios(V1)
Figure 4 for Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios(V1)
Viaarxiv icon