Picture for Jianrong Wang

Jianrong Wang

Emotional Talking Head Generation based on Memory-Sharing and Attention-Augmented Networks

Add code
Jun 06, 2023
Viaarxiv icon

MAVD: The First Open Large-Scale Mandarin Audio-Visual Dataset with Depth Information

Add code
Jun 04, 2023
Viaarxiv icon

Two-Stream Joint-Training for Speaker Independent Acoustic-to-Articulatory Inversion

Add code
Feb 26, 2023
Viaarxiv icon

MVNet: Memory Assistance and Vocal Reinforcement Network for Speech Enhancement

Add code
Sep 15, 2022
Figure 1 for MVNet: Memory Assistance and Vocal Reinforcement Network for Speech Enhancement
Figure 2 for MVNet: Memory Assistance and Vocal Reinforcement Network for Speech Enhancement
Figure 3 for MVNet: Memory Assistance and Vocal Reinforcement Network for Speech Enhancement
Figure 4 for MVNet: Memory Assistance and Vocal Reinforcement Network for Speech Enhancement
Viaarxiv icon

Acoustic-to-articulatory Inversion based on Speech Decomposition and Auxiliary Feature

Add code
Apr 02, 2022
Figure 1 for Acoustic-to-articulatory Inversion based on Speech Decomposition and Auxiliary Feature
Figure 2 for Acoustic-to-articulatory Inversion based on Speech Decomposition and Auxiliary Feature
Figure 3 for Acoustic-to-articulatory Inversion based on Speech Decomposition and Auxiliary Feature
Figure 4 for Acoustic-to-articulatory Inversion based on Speech Decomposition and Auxiliary Feature
Viaarxiv icon

Residual-guided Personalized Speech Synthesis based on Face Image

Add code
Apr 01, 2022
Figure 1 for Residual-guided Personalized Speech Synthesis based on Face Image
Figure 2 for Residual-guided Personalized Speech Synthesis based on Face Image
Figure 3 for Residual-guided Personalized Speech Synthesis based on Face Image
Figure 4 for Residual-guided Personalized Speech Synthesis based on Face Image
Viaarxiv icon

Cross-Modal Knowledge Distillation Method for Automatic Cued Speech Recognition

Add code
Jun 25, 2021
Figure 1 for Cross-Modal Knowledge Distillation Method for Automatic Cued Speech Recognition
Figure 2 for Cross-Modal Knowledge Distillation Method for Automatic Cued Speech Recognition
Figure 3 for Cross-Modal Knowledge Distillation Method for Automatic Cued Speech Recognition
Figure 4 for Cross-Modal Knowledge Distillation Method for Automatic Cued Speech Recognition
Viaarxiv icon

Three-Dimensional Lip Motion Network for Text-Independent Speaker Recognition

Add code
Oct 13, 2020
Figure 1 for Three-Dimensional Lip Motion Network for Text-Independent Speaker Recognition
Figure 2 for Three-Dimensional Lip Motion Network for Text-Independent Speaker Recognition
Figure 3 for Three-Dimensional Lip Motion Network for Text-Independent Speaker Recognition
Figure 4 for Three-Dimensional Lip Motion Network for Text-Independent Speaker Recognition
Viaarxiv icon

Attention-based Residual Speech Portrait Model for Speech to Face Generation

Add code
Jul 09, 2020
Figure 1 for Attention-based Residual Speech Portrait Model for Speech to Face Generation
Figure 2 for Attention-based Residual Speech Portrait Model for Speech to Face Generation
Figure 3 for Attention-based Residual Speech Portrait Model for Speech to Face Generation
Figure 4 for Attention-based Residual Speech Portrait Model for Speech to Face Generation
Viaarxiv icon

Self-Supervised Joint Learning Framework of Depth Estimation via Implicit Cues

Add code
Jun 26, 2020
Figure 1 for Self-Supervised Joint Learning Framework of Depth Estimation via Implicit Cues
Figure 2 for Self-Supervised Joint Learning Framework of Depth Estimation via Implicit Cues
Figure 3 for Self-Supervised Joint Learning Framework of Depth Estimation via Implicit Cues
Figure 4 for Self-Supervised Joint Learning Framework of Depth Estimation via Implicit Cues
Viaarxiv icon