Picture for Baoxiang Li

Baoxiang Li

SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations

Add code
Oct 29, 2025
Figure 1 for SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations
Figure 2 for SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations
Figure 3 for SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations
Figure 4 for SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations
Viaarxiv icon

Dual-path Transformer Based Neural Beamformer for Target Speech Extraction

Add code
Sep 07, 2023
Figure 1 for Dual-path Transformer Based Neural Beamformer for Target Speech Extraction
Figure 2 for Dual-path Transformer Based Neural Beamformer for Target Speech Extraction
Figure 3 for Dual-path Transformer Based Neural Beamformer for Target Speech Extraction
Figure 4 for Dual-path Transformer Based Neural Beamformer for Target Speech Extraction
Viaarxiv icon

Say Goodbye to RNN-T Loss: A Novel CIF-based Transducer Architecture for Automatic Speech Recognition

Add code
Jul 27, 2023
Figure 1 for Say Goodbye to RNN-T Loss: A Novel CIF-based Transducer Architecture for Automatic Speech Recognition
Figure 2 for Say Goodbye to RNN-T Loss: A Novel CIF-based Transducer Architecture for Automatic Speech Recognition
Figure 3 for Say Goodbye to RNN-T Loss: A Novel CIF-based Transducer Architecture for Automatic Speech Recognition
Figure 4 for Say Goodbye to RNN-T Loss: A Novel CIF-based Transducer Architecture for Automatic Speech Recognition
Viaarxiv icon

Dynamic Latency for CTC-Based Streaming Automatic Speech Recognition With Emformer

Add code
Mar 29, 2022
Figure 1 for Dynamic Latency for CTC-Based Streaming Automatic Speech Recognition With Emformer
Figure 2 for Dynamic Latency for CTC-Based Streaming Automatic Speech Recognition With Emformer
Figure 3 for Dynamic Latency for CTC-Based Streaming Automatic Speech Recognition With Emformer
Figure 4 for Dynamic Latency for CTC-Based Streaming Automatic Speech Recognition With Emformer
Viaarxiv icon

Locality Matters: A Locality-Biased Linear Attention for Automatic Speech Recognition

Add code
Mar 29, 2022
Figure 1 for Locality Matters: A Locality-Biased Linear Attention for Automatic Speech Recognition
Figure 2 for Locality Matters: A Locality-Biased Linear Attention for Automatic Speech Recognition
Figure 3 for Locality Matters: A Locality-Biased Linear Attention for Automatic Speech Recognition
Figure 4 for Locality Matters: A Locality-Biased Linear Attention for Automatic Speech Recognition
Viaarxiv icon