Picture for Anshu Bhatia

Anshu Bhatia

Zero-resource Speech Translation and Recognition with LLMs

Add code
Dec 24, 2024
Viaarxiv icon

SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

Add code
May 14, 2024
Viaarxiv icon

Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters

Add code
Jul 02, 2023
Viaarxiv icon

Masked Audio Text Encoders are Effective Multi-Modal Rescorers

Add code
May 24, 2023
Figure 1 for Masked Audio Text Encoders are Effective Multi-Modal Rescorers
Figure 2 for Masked Audio Text Encoders are Effective Multi-Modal Rescorers
Figure 3 for Masked Audio Text Encoders are Effective Multi-Modal Rescorers
Figure 4 for Masked Audio Text Encoders are Effective Multi-Modal Rescorers
Viaarxiv icon