Picture for Xiaowei Yi

Xiaowei Yi

DisentTalk: Cross-lingual Talking Face Generation via Semantic Disentangled Diffusion Model

Add code
Mar 24, 2025
Viaarxiv icon

Vision Transformer Based Video Hashing Retrieval for Tracing the Source of Fake Videos

Add code
Dec 15, 2021
Figure 1 for Vision Transformer Based Video Hashing Retrieval for Tracing the Source of Fake Videos
Figure 2 for Vision Transformer Based Video Hashing Retrieval for Tracing the Source of Fake Videos
Figure 3 for Vision Transformer Based Video Hashing Retrieval for Tracing the Source of Fake Videos
Figure 4 for Vision Transformer Based Video Hashing Retrieval for Tracing the Source of Fake Videos
Viaarxiv icon

FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection

Add code
Oct 18, 2021
Figure 1 for FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection
Figure 2 for FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection
Figure 3 for FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection
Figure 4 for FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection
Viaarxiv icon

MediumVC: Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

Add code
Oct 06, 2021
Figure 1 for MediumVC: Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
Figure 2 for MediumVC: Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
Figure 3 for MediumVC: Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
Figure 4 for MediumVC: Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
Viaarxiv icon