Picture for Arun Narayanan

Arun Narayanan

Can DeepFake Speech be Reliably Detected?

Add code
Oct 09, 2024
Figure 1 for Can DeepFake Speech be Reliably Detected?
Figure 2 for Can DeepFake Speech be Reliably Detected?
Figure 3 for Can DeepFake Speech be Reliably Detected?
Figure 4 for Can DeepFake Speech be Reliably Detected?
Viaarxiv icon

Training Large ASR Encoders with Differential Privacy

Add code
Sep 21, 2024
Figure 1 for Training Large ASR Encoders with Differential Privacy
Figure 2 for Training Large ASR Encoders with Differential Privacy
Figure 3 for Training Large ASR Encoders with Differential Privacy
Figure 4 for Training Large ASR Encoders with Differential Privacy
Viaarxiv icon

Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping

Add code
Jun 04, 2024
Viaarxiv icon

Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models

Add code
Feb 27, 2024
Viaarxiv icon

A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation

Add code
Sep 14, 2022
Figure 1 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Figure 2 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Figure 3 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Figure 4 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Viaarxiv icon

Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments

Add code
May 17, 2022
Figure 1 for Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments
Figure 2 for Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments
Figure 3 for Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments
Figure 4 for Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments
Viaarxiv icon

A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy

Add code
May 06, 2022
Figure 1 for A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy
Figure 2 for A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy
Figure 3 for A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy
Figure 4 for A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy
Viaarxiv icon

Mask scalar prediction for improving robust automatic speech recognition

Add code
Apr 26, 2022
Figure 1 for Mask scalar prediction for improving robust automatic speech recognition
Figure 2 for Mask scalar prediction for improving robust automatic speech recognition
Figure 3 for Mask scalar prediction for improving robust automatic speech recognition
Figure 4 for Mask scalar prediction for improving robust automatic speech recognition
Viaarxiv icon

Extracting Targeted Training Data from ASR Models, and How to Mitigate It

Add code
Apr 18, 2022
Figure 1 for Extracting Targeted Training Data from ASR Models, and How to Mitigate It
Figure 2 for Extracting Targeted Training Data from ASR Models, and How to Mitigate It
Figure 3 for Extracting Targeted Training Data from ASR Models, and How to Mitigate It
Figure 4 for Extracting Targeted Training Data from ASR Models, and How to Mitigate It
Viaarxiv icon

Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition

Add code
Apr 13, 2022
Figure 1 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 2 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 3 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 4 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Viaarxiv icon