Picture for Peng Shen

Peng Shen

Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR

Add code
Sep 03, 2024
Viaarxiv icon

Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition

Add code
Dec 18, 2023
Viaarxiv icon

Generative linguistic representation for spoken language identification

Add code
Dec 18, 2023
Viaarxiv icon

Neural domain alignment for spoken language recognition based on optimal transport

Add code
Oct 20, 2023
Viaarxiv icon

Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-based ASR

Add code
Sep 28, 2023
Viaarxiv icon

Cross-modal Alignment with Optimal Transport for CTC-based ASR

Add code
Sep 24, 2023
Viaarxiv icon

Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition

Add code
Jul 29, 2022
Figure 1 for Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition
Figure 2 for Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition
Figure 3 for Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition
Figure 4 for Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition
Viaarxiv icon

Transducer-based language embedding for spoken language identification

Add code
Apr 08, 2022
Figure 1 for Transducer-based language embedding for spoken language identification
Figure 2 for Transducer-based language embedding for spoken language identification
Figure 3 for Transducer-based language embedding for spoken language identification
Viaarxiv icon

Partial Coupling of Optimal Transport for Spoken Language Identification

Add code
Mar 31, 2022
Figure 1 for Partial Coupling of Optimal Transport for Spoken Language Identification
Figure 2 for Partial Coupling of Optimal Transport for Spoken Language Identification
Figure 3 for Partial Coupling of Optimal Transport for Spoken Language Identification
Figure 4 for Partial Coupling of Optimal Transport for Spoken Language Identification
Viaarxiv icon

Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification

Add code
Apr 07, 2021
Figure 1 for Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification
Figure 2 for Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification
Figure 3 for Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification
Figure 4 for Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification
Viaarxiv icon