Picture for Xiaoyu Yang

Xiaoyu Yang

SALMONN-omni: A Codec-free LLM for Full-duplex Speech Understanding and Generation

Add code
Nov 27, 2024
Viaarxiv icon

k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning

Add code
Nov 26, 2024
Viaarxiv icon

Masked Image Contrastive Learning for Efficient Visual Conceptual Pre-training

Add code
Nov 15, 2024
Viaarxiv icon

CR-CTC: Consistency regularization on CTC for improved speech recognition

Add code
Oct 07, 2024
Viaarxiv icon

MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events

Add code
Sep 25, 2024
Figure 1 for MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events
Figure 2 for MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events
Figure 3 for MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events
Figure 4 for MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events
Viaarxiv icon

LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization

Add code
Sep 01, 2024
Figure 1 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Figure 2 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Figure 3 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Figure 4 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Viaarxiv icon

Interference Management in MIMO-ISAC Systems: A Transceiver Design Approach

Add code
Jul 07, 2024
Figure 1 for Interference Management in MIMO-ISAC Systems: A Transceiver Design Approach
Figure 2 for Interference Management in MIMO-ISAC Systems: A Transceiver Design Approach
Figure 3 for Interference Management in MIMO-ISAC Systems: A Transceiver Design Approach
Figure 4 for Interference Management in MIMO-ISAC Systems: A Transceiver Design Approach
Viaarxiv icon

SUBLLM: A Novel Efficient Architecture with Token Sequence Subsampling for LLM

Add code
Jun 03, 2024
Viaarxiv icon

Adapting Multi-modal Large Language Model to Concept Drift in the Long-tailed Open World

Add code
May 22, 2024
Viaarxiv icon

ViLaM: A Vision-Language Model with Enhanced Visual Grounding and Generalization Capability

Add code
Nov 21, 2023
Viaarxiv icon