Picture for Richard Rose

Richard Rose

Cascaded encoders for fine-tuning ASR models on overlapped speech

Add code
Jun 28, 2023
Viaarxiv icon

End-to-end multi-talker audio-visual ASR using an active speaker attention module

Add code
Apr 01, 2022
Figure 1 for End-to-end multi-talker audio-visual ASR using an active speaker attention module
Figure 2 for End-to-end multi-talker audio-visual ASR using an active speaker attention module
Figure 3 for End-to-end multi-talker audio-visual ASR using an active speaker attention module
Figure 4 for End-to-end multi-talker audio-visual ASR using an active speaker attention module
Viaarxiv icon