Picture for Arun Narayanan

Arun Narayanan

Can DeepFake Speech be Reliably Detected?

Add code
Oct 09, 2024
Figure 1 for Can DeepFake Speech be Reliably Detected?
Figure 2 for Can DeepFake Speech be Reliably Detected?
Figure 3 for Can DeepFake Speech be Reliably Detected?
Figure 4 for Can DeepFake Speech be Reliably Detected?
Viaarxiv icon

Training Large ASR Encoders with Differential Privacy

Add code
Sep 21, 2024
Viaarxiv icon

Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping

Add code
Jun 04, 2024
Viaarxiv icon

Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models

Add code
Feb 27, 2024
Viaarxiv icon

A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation

Add code
Sep 14, 2022
Figure 1 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Figure 2 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Figure 3 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Figure 4 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Viaarxiv icon

Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments

Add code
May 17, 2022
Figure 1 for Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments
Figure 2 for Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments
Figure 3 for Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments
Figure 4 for Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments
Viaarxiv icon

A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy

Add code
May 06, 2022
Figure 1 for A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy
Figure 2 for A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy
Figure 3 for A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy
Figure 4 for A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy
Viaarxiv icon

Mask scalar prediction for improving robust automatic speech recognition

Add code
Apr 26, 2022
Figure 1 for Mask scalar prediction for improving robust automatic speech recognition
Figure 2 for Mask scalar prediction for improving robust automatic speech recognition
Figure 3 for Mask scalar prediction for improving robust automatic speech recognition
Figure 4 for Mask scalar prediction for improving robust automatic speech recognition
Viaarxiv icon

Extracting Targeted Training Data from ASR Models, and How to Mitigate It

Add code
Apr 18, 2022
Figure 1 for Extracting Targeted Training Data from ASR Models, and How to Mitigate It
Figure 2 for Extracting Targeted Training Data from ASR Models, and How to Mitigate It
Figure 3 for Extracting Targeted Training Data from ASR Models, and How to Mitigate It
Figure 4 for Extracting Targeted Training Data from ASR Models, and How to Mitigate It
Viaarxiv icon

Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition

Add code
Apr 13, 2022
Figure 1 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 2 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 3 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 4 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Viaarxiv icon