Picture for Bingshen Mu

Bingshen Mu

HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models

Add code
Sep 30, 2024
Figure 1 for HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models
Figure 2 for HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models
Figure 3 for HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models
Figure 4 for HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models
Viaarxiv icon

MMGER: Multi-modal and Multi-granularity Generative Error Correction with LLM for Joint Accent and Speech Recognition

Add code
May 06, 2024
Viaarxiv icon

Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets

Add code
May 06, 2024
Figure 1 for Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
Figure 2 for Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
Figure 3 for Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
Figure 4 for Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
Viaarxiv icon

E-chat: Emotion-sensitive Spoken Dialogue System with Large Language Models

Add code
Jan 06, 2024
Viaarxiv icon

Automatic channel selection and spatial feature integration for multi-channel speech recognition across various array topologies

Add code
Dec 15, 2023
Viaarxiv icon

Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network

Add code
May 21, 2023
Viaarxiv icon

The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge

Add code
Mar 11, 2023
Viaarxiv icon