Picture for Dmitriy Serdyuk

Dmitriy Serdyuk

USM RNN-T model weights binarization

Add code
Jun 06, 2024
Viaarxiv icon

On Robustness to Missing Video for Audiovisual Speech Recognition

Add code
Dec 19, 2023
Figure 1 for On Robustness to Missing Video for Audiovisual Speech Recognition
Figure 2 for On Robustness to Missing Video for Audiovisual Speech Recognition
Figure 3 for On Robustness to Missing Video for Audiovisual Speech Recognition
Figure 4 for On Robustness to Missing Video for Audiovisual Speech Recognition
Viaarxiv icon

Audio-visual fine-tuning of audio-only ASR models

Add code
Dec 14, 2023
Viaarxiv icon

Conformers are All You Need for Visual Speech Recogntion

Add code
Feb 17, 2023
Viaarxiv icon

Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition

Add code
Jan 25, 2022
Figure 1 for Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition
Figure 2 for Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition
Figure 3 for Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition
Figure 4 for Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition
Viaarxiv icon

Audio-Visual Speech Recognition is Worth 32$\times$32$\times$8 Voxels

Add code
Sep 20, 2021
Figure 1 for Audio-Visual Speech Recognition is Worth 32$\times$32$\times$8 Voxels
Figure 2 for Audio-Visual Speech Recognition is Worth 32$\times$32$\times$8 Voxels
Figure 3 for Audio-Visual Speech Recognition is Worth 32$\times$32$\times$8 Voxels
Figure 4 for Audio-Visual Speech Recognition is Worth 32$\times$32$\times$8 Voxels
Viaarxiv icon

Accounting for Variance in Machine Learning Benchmarks

Add code
Mar 01, 2021
Figure 1 for Accounting for Variance in Machine Learning Benchmarks
Figure 2 for Accounting for Variance in Machine Learning Benchmarks
Figure 3 for Accounting for Variance in Machine Learning Benchmarks
Figure 4 for Accounting for Variance in Machine Learning Benchmarks
Viaarxiv icon

Unsupervised adversarial domain adaptation for acoustic scene classification

Add code
Aug 17, 2018
Figure 1 for Unsupervised adversarial domain adaptation for acoustic scene classification
Figure 2 for Unsupervised adversarial domain adaptation for acoustic scene classification
Figure 3 for Unsupervised adversarial domain adaptation for acoustic scene classification
Viaarxiv icon

Twin Regularization for online speech recognition

Add code
Jun 12, 2018
Figure 1 for Twin Regularization for online speech recognition
Figure 2 for Twin Regularization for online speech recognition
Figure 3 for Twin Regularization for online speech recognition
Figure 4 for Twin Regularization for online speech recognition
Viaarxiv icon

Fortified Networks: Improving the Robustness of Deep Networks by Modeling the Manifold of Hidden Representations

Add code
Apr 07, 2018
Figure 1 for Fortified Networks: Improving the Robustness of Deep Networks by Modeling the Manifold of Hidden Representations
Figure 2 for Fortified Networks: Improving the Robustness of Deep Networks by Modeling the Manifold of Hidden Representations
Figure 3 for Fortified Networks: Improving the Robustness of Deep Networks by Modeling the Manifold of Hidden Representations
Figure 4 for Fortified Networks: Improving the Robustness of Deep Networks by Modeling the Manifold of Hidden Representations
Viaarxiv icon