Picture for Hank Liao

Hank Liao

Google Inc

DiarizationLM: Speaker Diarization Post-Processing with Large Language Models

Add code
Jan 16, 2024
Viaarxiv icon

On Robustness to Missing Video for Audiovisual Speech Recognition

Add code
Dec 19, 2023
Viaarxiv icon

Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network

Add code
Sep 15, 2023
Viaarxiv icon

USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models

Add code
Sep 14, 2023
Viaarxiv icon

Conformers are All You Need for Visual Speech Recogntion

Add code
Feb 17, 2023
Viaarxiv icon

End-to-End Multi-Person Audio/Visual Automatic Speech Recognition

Add code
May 11, 2022
Figure 1 for End-to-End Multi-Person Audio/Visual Automatic Speech Recognition
Figure 2 for End-to-End Multi-Person Audio/Visual Automatic Speech Recognition
Figure 3 for End-to-End Multi-Person Audio/Visual Automatic Speech Recognition
Figure 4 for End-to-End Multi-Person Audio/Visual Automatic Speech Recognition
Viaarxiv icon

Recurrent Neural Network Transducer for Audio-Visual Speech Recognition

Add code
Nov 08, 2019
Figure 1 for Recurrent Neural Network Transducer for Audio-Visual Speech Recognition
Figure 2 for Recurrent Neural Network Transducer for Audio-Visual Speech Recognition
Figure 3 for Recurrent Neural Network Transducer for Audio-Visual Speech Recognition
Figure 4 for Recurrent Neural Network Transducer for Audio-Visual Speech Recognition
Viaarxiv icon

A comparison of end-to-end models for long-form speech recognition

Add code
Nov 06, 2019
Figure 1 for A comparison of end-to-end models for long-form speech recognition
Figure 2 for A comparison of end-to-end models for long-form speech recognition
Figure 3 for A comparison of end-to-end models for long-form speech recognition
Viaarxiv icon

Adversarial Training for Multilingual Acoustic Modeling

Add code
Jun 17, 2019
Figure 1 for Adversarial Training for Multilingual Acoustic Modeling
Figure 2 for Adversarial Training for Multilingual Acoustic Modeling
Figure 3 for Adversarial Training for Multilingual Acoustic Modeling
Figure 4 for Adversarial Training for Multilingual Acoustic Modeling
Viaarxiv icon

Neural Language Modeling with Visual Features

Add code
Mar 07, 2019
Figure 1 for Neural Language Modeling with Visual Features
Figure 2 for Neural Language Modeling with Visual Features
Figure 3 for Neural Language Modeling with Visual Features
Figure 4 for Neural Language Modeling with Visual Features
Viaarxiv icon