Picture for Jianguo Wei

Jianguo Wei

Whisper-MLA: Reducing GPU Memory Consumption of ASR Models based on MHA2MLA Conversion

Add code
Feb 28, 2026
Viaarxiv icon

SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis

Add code
Feb 08, 2026
Viaarxiv icon

Listening for "You": Enhancing Speech Image Retrieval via Target Speaker Extraction

Add code
Sep 11, 2025
Viaarxiv icon

Detect an Object At Once without Fine-tuning

Add code
Nov 04, 2024
Figure 1 for Detect an Object At Once without Fine-tuning
Figure 2 for Detect an Object At Once without Fine-tuning
Figure 3 for Detect an Object At Once without Fine-tuning
Figure 4 for Detect an Object At Once without Fine-tuning
Viaarxiv icon

Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image

Add code
Oct 20, 2024
Figure 1 for Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image
Figure 2 for Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image
Figure 3 for Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image
Figure 4 for Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image
Viaarxiv icon

You Only Speak Once to See

Add code
Sep 27, 2024
Figure 1 for You Only Speak Once to See
Figure 2 for You Only Speak Once to See
Figure 3 for You Only Speak Once to See
Figure 4 for You Only Speak Once to See
Viaarxiv icon

Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification

Add code
Sep 14, 2024
Figure 1 for Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification
Figure 2 for Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification
Figure 3 for Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification
Figure 4 for Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification
Viaarxiv icon

Channel Adaptation for Speaker Verification Using Optimal Transport with Pseudo Label

Add code
Sep 14, 2024
Figure 1 for Channel Adaptation for Speaker Verification Using Optimal Transport with Pseudo Label
Figure 2 for Channel Adaptation for Speaker Verification Using Optimal Transport with Pseudo Label
Figure 3 for Channel Adaptation for Speaker Verification Using Optimal Transport with Pseudo Label
Figure 4 for Channel Adaptation for Speaker Verification Using Optimal Transport with Pseudo Label
Viaarxiv icon

Robust Channel Learning for Large-Scale Radio Speaker Verification

Add code
Jun 16, 2024
Viaarxiv icon

Neighborhood Attention Transformer with Progressive Channel Fusion for Speaker Verification

Add code
May 20, 2024
Viaarxiv icon