Picture for Honglie Chen

Honglie Chen

Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs

Add code
Nov 04, 2024
Viaarxiv icon

RT-LA-VocE: Real-Time Low-SNR Audio-Visual Speech Enhancement

Add code
Jul 10, 2024
Viaarxiv icon

MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization

Add code
Jun 25, 2024
Figure 1 for MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization
Figure 2 for MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization
Figure 3 for MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization
Figure 4 for MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization
Viaarxiv icon

SparseVSR: Lightweight and Noise Robust Visual Speech Recognition

Add code
Jul 10, 2023
Viaarxiv icon

SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision

Add code
Apr 03, 2023
Viaarxiv icon

Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels

Add code
Mar 25, 2023
Viaarxiv icon

Audio-Visual Synchronisation in the wild

Add code
Dec 08, 2021
Figure 1 for Audio-Visual Synchronisation in the wild
Figure 2 for Audio-Visual Synchronisation in the wild
Figure 3 for Audio-Visual Synchronisation in the wild
Figure 4 for Audio-Visual Synchronisation in the wild
Viaarxiv icon

Localizing Visual Sounds the Hard Way

Add code
Apr 06, 2021
Figure 1 for Localizing Visual Sounds the Hard Way
Figure 2 for Localizing Visual Sounds the Hard Way
Figure 3 for Localizing Visual Sounds the Hard Way
Figure 4 for Localizing Visual Sounds the Hard Way
Viaarxiv icon

VGGSound: A Large-scale Audio-Visual Dataset

Add code
Apr 29, 2020
Figure 1 for VGGSound: A Large-scale Audio-Visual Dataset
Figure 2 for VGGSound: A Large-scale Audio-Visual Dataset
Figure 3 for VGGSound: A Large-scale Audio-Visual Dataset
Figure 4 for VGGSound: A Large-scale Audio-Visual Dataset
Viaarxiv icon

AutoCorrect: Deep Inductive Alignment of Noisy Geometric Annotations

Add code
Aug 14, 2019
Figure 1 for AutoCorrect: Deep Inductive Alignment of Noisy Geometric Annotations
Figure 2 for AutoCorrect: Deep Inductive Alignment of Noisy Geometric Annotations
Figure 3 for AutoCorrect: Deep Inductive Alignment of Noisy Geometric Annotations
Figure 4 for AutoCorrect: Deep Inductive Alignment of Noisy Geometric Annotations
Viaarxiv icon