Picture for Bang Zeng

Bang Zeng

LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models

Add code
Apr 10, 2025
Figure 1 for LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models
Figure 2 for LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models
Figure 3 for LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models
Viaarxiv icon

Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection

Add code
Jan 07, 2025
Figure 1 for Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection
Figure 2 for Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection
Figure 3 for Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection
Figure 4 for Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection
Viaarxiv icon

TSELM: Target Speaker Extraction using Discrete Tokens and Language Models

Add code
Sep 12, 2024
Viaarxiv icon

USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction

Add code
Sep 04, 2024
Figure 1 for USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction
Figure 2 for USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction
Figure 3 for USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction
Figure 4 for USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction
Viaarxiv icon

Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios(V1)

Add code
Jun 17, 2022
Figure 1 for Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios(V1)
Figure 2 for Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios(V1)
Figure 3 for Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios(V1)
Figure 4 for Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios(V1)
Viaarxiv icon