Picture for Xin Lei

Xin Lei

Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning

Add code
Aug 27, 2024
Viaarxiv icon

Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting

Add code
Aug 23, 2024
Figure 1 for Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
Figure 2 for Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
Figure 3 for Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
Figure 4 for Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
Viaarxiv icon

LLaMA based Punctuation Restoration With Forward Pass Only Decoding

Add code
Aug 09, 2024
Viaarxiv icon

FADI-AEC: Fast Score Based Diffusion Model Guided by Far-end Signal for Acoustic Echo Cancellation

Add code
Jan 08, 2024
Viaarxiv icon

Directional Source Separation for Robust Speech Recognition on Smart Glasses

Add code
Sep 20, 2023
Viaarxiv icon

TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models

Add code
Sep 05, 2023
Figure 1 for TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models
Figure 2 for TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models
Figure 3 for TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models
Figure 4 for TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models
Viaarxiv icon

LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword Spotting

Add code
Nov 09, 2022
Viaarxiv icon

SCA: Streaming Cross-attention Alignment for Echo Cancellation

Add code
Nov 01, 2022
Viaarxiv icon

U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition

Add code
Jul 07, 2021
Figure 1 for U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition
Figure 2 for U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition
Figure 3 for U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition
Figure 4 for U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition
Viaarxiv icon

WeNet: Production First and Production Ready End-to-End Speech Recognition Toolkit

Add code
Feb 02, 2021
Figure 1 for WeNet: Production First and Production Ready End-to-End Speech Recognition Toolkit
Figure 2 for WeNet: Production First and Production Ready End-to-End Speech Recognition Toolkit
Figure 3 for WeNet: Production First and Production Ready End-to-End Speech Recognition Toolkit
Figure 4 for WeNet: Production First and Production Ready End-to-End Speech Recognition Toolkit
Viaarxiv icon