Picture for Xiaoyu Yang

Xiaoyu Yang

Masked Image Contrastive Learning for Efficient Visual Conceptual Pre-training

Add code
Nov 15, 2024
Viaarxiv icon

CR-CTC: Consistency regularization on CTC for improved speech recognition

Add code
Oct 07, 2024
Viaarxiv icon

MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events

Add code
Sep 25, 2024
Figure 1 for MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events
Figure 2 for MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events
Figure 3 for MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events
Figure 4 for MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events
Viaarxiv icon

LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization

Add code
Sep 01, 2024
Figure 1 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Figure 2 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Figure 3 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Figure 4 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Viaarxiv icon

Interference Management in MIMO-ISAC Systems: A Transceiver Design Approach

Add code
Jul 07, 2024
Viaarxiv icon

SUBLLM: A Novel Efficient Architecture with Token Sequence Subsampling for LLM

Add code
Jun 03, 2024
Viaarxiv icon

Adapting Multi-modal Large Language Model to Concept Drift in the Long-tailed Open World

Add code
May 22, 2024
Viaarxiv icon

ViLaM: A Vision-Language Model with Enhanced Visual Grounding and Generalization Capability

Add code
Nov 21, 2023
Viaarxiv icon

Zipformer: A faster and better encoder for automatic speech recognition

Add code
Oct 17, 2023
Viaarxiv icon

Mutual Information Metrics for Uplink MIMO-OFDM Integrated Sensing and Communication System

Add code
Oct 10, 2023
Viaarxiv icon