Picture for Mingkun Huang

Mingkun Huang

NEST-RQ: Next Token Prediction for Speech Self-Supervised Pre-Training

Add code
Sep 13, 2024
Figure 1 for NEST-RQ: Next Token Prediction for Speech Self-Supervised Pre-Training
Figure 2 for NEST-RQ: Next Token Prediction for Speech Self-Supervised Pre-Training
Figure 3 for NEST-RQ: Next Token Prediction for Speech Self-Supervised Pre-Training
Figure 4 for NEST-RQ: Next Token Prediction for Speech Self-Supervised Pre-Training
Viaarxiv icon

Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition

Add code
Jul 05, 2024
Viaarxiv icon

Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks

Add code
Mar 09, 2022
Figure 1 for Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks
Figure 2 for Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks
Figure 3 for Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks
Figure 4 for Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks
Viaarxiv icon