Picture for Byeong-Yeol Kim

Byeong-Yeol Kim

Bridging the Gap between Audio and Text using Parallel-attention for User-defined Keyword Spotting

Add code
Aug 07, 2024
Viaarxiv icon

Faces that Speak: Jointly Synthesising Talking Face and Speech from Text

Add code
May 16, 2024
Viaarxiv icon

Boosting Unknown-number Speaker Separation with Transformer Decoder-based Attractor

Add code
Jan 23, 2024
Viaarxiv icon

Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling

Add code
Apr 18, 2023
Viaarxiv icon

Joint unsupervised and supervised learning for context-aware language identification

Add code
Apr 14, 2023
Viaarxiv icon

That's What I Said: Fully-Controllable Talking Face Generation

Add code
Apr 06, 2023
Viaarxiv icon

CrossSpeech: Speaker-independent Acoustic Representation for Cross-lingual Speech Synthesis

Add code
Feb 28, 2023
Viaarxiv icon

TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation

Add code
Nov 22, 2022
Viaarxiv icon

Metric Learning for User-defined Keyword Spotting

Add code
Nov 01, 2022
Viaarxiv icon

TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation

Add code
Sep 08, 2022
Figure 1 for TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Figure 2 for TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Figure 3 for TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Figure 4 for TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Viaarxiv icon