Picture for Shiliang Zhang

Shiliang Zhang

Enhancing Low-Resource ASR through Versatile TTS: Bridging the Data Gap

Add code
Oct 22, 2024
Viaarxiv icon

Long-distance Geomagnetic Navigation in GNSS-denied Environments with Deep Reinforcement Learning

Add code
Oct 21, 2024
Viaarxiv icon

Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition

Add code
Sep 26, 2024
Viaarxiv icon

Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study

Add code
Sep 26, 2024
Figure 1 for Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study
Figure 2 for Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study
Figure 3 for Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study
Figure 4 for Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study
Viaarxiv icon

Exploring SSL Discrete Speech Features for Zipformer-based Contextual ASR

Add code
Sep 13, 2024
Figure 1 for Exploring SSL Discrete Speech Features for Zipformer-based Contextual ASR
Figure 2 for Exploring SSL Discrete Speech Features for Zipformer-based Contextual ASR
Figure 3 for Exploring SSL Discrete Speech Features for Zipformer-based Contextual ASR
Figure 4 for Exploring SSL Discrete Speech Features for Zipformer-based Contextual ASR
Viaarxiv icon

Dataset Distillation for Histopathology Image Classification

Add code
Aug 19, 2024
Viaarxiv icon

CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens

Add code
Jul 09, 2024
Viaarxiv icon

Skip-Layer Attention: Bridging Abstract and Detailed Dependencies in Transformers

Add code
Jun 17, 2024
Figure 1 for Skip-Layer Attention: Bridging Abstract and Detailed Dependencies in Transformers
Figure 2 for Skip-Layer Attention: Bridging Abstract and Detailed Dependencies in Transformers
Figure 3 for Skip-Layer Attention: Bridging Abstract and Detailed Dependencies in Transformers
Figure 4 for Skip-Layer Attention: Bridging Abstract and Detailed Dependencies in Transformers
Viaarxiv icon

Self-Distillation Prototypes Network: Learning Robust Speaker Representations without Supervision

Add code
Jun 17, 2024
Viaarxiv icon

MaLa-ASR: Multimedia-Assisted LLM-Based ASR

Add code
Jun 09, 2024
Figure 1 for MaLa-ASR: Multimedia-Assisted LLM-Based ASR
Figure 2 for MaLa-ASR: Multimedia-Assisted LLM-Based ASR
Figure 3 for MaLa-ASR: Multimedia-Assisted LLM-Based ASR
Figure 4 for MaLa-ASR: Multimedia-Assisted LLM-Based ASR
Viaarxiv icon