Picture for Rao Ma

Rao Ma

Scaling and Prompting for Improved End-to-End Spoken Grammatical Error Correction

Add code
May 27, 2025
Figure 1 for Scaling and Prompting for Improved End-to-End Spoken Grammatical Error Correction
Figure 2 for Scaling and Prompting for Improved End-to-End Spoken Grammatical Error Correction
Figure 3 for Scaling and Prompting for Improved End-to-End Spoken Grammatical Error Correction
Figure 4 for Scaling and Prompting for Improved End-to-End Spoken Grammatical Error Correction
Viaarxiv icon

Assessment of L2 Oral Proficiency using Speech Large Language Models

Add code
May 27, 2025
Figure 1 for Assessment of L2 Oral Proficiency using Speech Large Language Models
Figure 2 for Assessment of L2 Oral Proficiency using Speech Large Language Models
Figure 3 for Assessment of L2 Oral Proficiency using Speech Large Language Models
Figure 4 for Assessment of L2 Oral Proficiency using Speech Large Language Models
Viaarxiv icon

Universal Acoustic Adversarial Attacks for Flexible Control of Speech-LLMs

Add code
May 20, 2025
Viaarxiv icon

LegoSLM: Connecting LLM with Speech Encoder using CTC Posteriors

Add code
May 16, 2025
Viaarxiv icon

ASR Error Correction using Large Language Models

Add code
Sep 14, 2024
Figure 1 for ASR Error Correction using Large Language Models
Figure 2 for ASR Error Correction using Large Language Models
Figure 3 for ASR Error Correction using Large Language Models
Figure 4 for ASR Error Correction using Large Language Models
Viaarxiv icon

Learn and Don't Forget: Adding a New Language to ASR Foundation Models

Add code
Jul 09, 2024
Figure 1 for Learn and Don't Forget: Adding a New Language to ASR Foundation Models
Figure 2 for Learn and Don't Forget: Adding a New Language to ASR Foundation Models
Figure 3 for Learn and Don't Forget: Adding a New Language to ASR Foundation Models
Figure 4 for Learn and Don't Forget: Adding a New Language to ASR Foundation Models
Viaarxiv icon

Cross-Lingual Transfer Learning for Speech Translation

Add code
Jul 01, 2024
Figure 1 for Cross-Lingual Transfer Learning for Speech Translation
Figure 2 for Cross-Lingual Transfer Learning for Speech Translation
Figure 3 for Cross-Lingual Transfer Learning for Speech Translation
Figure 4 for Cross-Lingual Transfer Learning for Speech Translation
Viaarxiv icon

Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models

Add code
May 09, 2024
Figure 1 for Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models
Figure 2 for Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models
Figure 3 for Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models
Figure 4 for Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models
Viaarxiv icon

Investigating the Emergent Audio Classification Ability of ASR Foundation Models

Add code
Nov 15, 2023
Figure 1 for Investigating the Emergent Audio Classification Ability of ASR Foundation Models
Figure 2 for Investigating the Emergent Audio Classification Ability of ASR Foundation Models
Figure 3 for Investigating the Emergent Audio Classification Ability of ASR Foundation Models
Figure 4 for Investigating the Emergent Audio Classification Ability of ASR Foundation Models
Viaarxiv icon

Towards End-to-End Spoken Grammatical Error Correction

Add code
Nov 09, 2023
Viaarxiv icon