Picture for Yiteng Huang

Yiteng Huang

M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses

Add code
Sep 17, 2024
Figure 1 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Figure 2 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Figure 3 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Figure 4 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Viaarxiv icon

Effective Integration of KAN for Keyword Spotting

Add code
Sep 13, 2024
Viaarxiv icon

Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning

Add code
Aug 27, 2024
Viaarxiv icon

Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting

Add code
Aug 23, 2024
Figure 1 for Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
Figure 2 for Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
Figure 3 for Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
Figure 4 for Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
Viaarxiv icon

AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition

Add code
Jan 18, 2024
Figure 1 for AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition
Figure 2 for AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition
Figure 3 for AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition
Figure 4 for AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition
Viaarxiv icon

FADI-AEC: Fast Score Based Diffusion Model Guided by Far-end Signal for Acoustic Echo Cancellation

Add code
Jan 08, 2024
Viaarxiv icon

Directional Source Separation for Robust Speech Recognition on Smart Glasses

Add code
Sep 20, 2023
Viaarxiv icon

Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches

Add code
Feb 17, 2023
Figure 1 for Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches
Figure 2 for Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches
Figure 3 for Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches
Figure 4 for Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches
Viaarxiv icon

LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword Spotting

Add code
Nov 09, 2022
Viaarxiv icon

Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments

Add code
May 17, 2022
Figure 1 for Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments
Figure 2 for Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments
Figure 3 for Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments
Figure 4 for Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments
Viaarxiv icon