Picture for Kevin J. Shih

Kevin J. Shih

VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation

Add code
Mar 14, 2023
Viaarxiv icon

Multilingual Multiaccented Multispeaker TTS with RADTTS

Add code
Jan 24, 2023
Viaarxiv icon

Collecting The Puzzle Pieces: Disentangled Self-Driven Human Pose Transfer by Permuting Textures

Add code
Oct 06, 2022
Figure 1 for Collecting The Puzzle Pieces: Disentangled Self-Driven Human Pose Transfer by Permuting Textures
Figure 2 for Collecting The Puzzle Pieces: Disentangled Self-Driven Human Pose Transfer by Permuting Textures
Figure 3 for Collecting The Puzzle Pieces: Disentangled Self-Driven Human Pose Transfer by Permuting Textures
Figure 4 for Collecting The Puzzle Pieces: Disentangled Self-Driven Human Pose Transfer by Permuting Textures
Viaarxiv icon

Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows

Add code
Mar 07, 2022
Figure 1 for Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows
Figure 2 for Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows
Figure 3 for Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows
Figure 4 for Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows
Viaarxiv icon

One TTS Alignment To Rule Them All

Add code
Aug 23, 2021
Figure 1 for One TTS Alignment To Rule Them All
Figure 2 for One TTS Alignment To Rule Them All
Figure 3 for One TTS Alignment To Rule Them All
Figure 4 for One TTS Alignment To Rule Them All
Viaarxiv icon

Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos

Add code
Jan 26, 2020
Figure 1 for Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos
Figure 2 for Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos
Figure 3 for Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos
Figure 4 for Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos
Viaarxiv icon

Video Interpolation and Prediction with Unsupervised Landmarks

Add code
Sep 06, 2019
Figure 1 for Video Interpolation and Prediction with Unsupervised Landmarks
Figure 2 for Video Interpolation and Prediction with Unsupervised Landmarks
Figure 3 for Video Interpolation and Prediction with Unsupervised Landmarks
Figure 4 for Video Interpolation and Prediction with Unsupervised Landmarks
Viaarxiv icon

Unsupervised Video Interpolation Using Cycle Consistency

Add code
Jun 13, 2019
Figure 1 for Unsupervised Video Interpolation Using Cycle Consistency
Figure 2 for Unsupervised Video Interpolation Using Cycle Consistency
Figure 3 for Unsupervised Video Interpolation Using Cycle Consistency
Figure 4 for Unsupervised Video Interpolation Using Cycle Consistency
Viaarxiv icon

Graphical Contrastive Losses for Scene Graph Generation

Add code
Mar 28, 2019
Figure 1 for Graphical Contrastive Losses for Scene Graph Generation
Figure 2 for Graphical Contrastive Losses for Scene Graph Generation
Figure 3 for Graphical Contrastive Losses for Scene Graph Generation
Figure 4 for Graphical Contrastive Losses for Scene Graph Generation
Viaarxiv icon

Improving Semantic Segmentation via Video Propagation and Label Relaxation

Add code
Dec 04, 2018
Figure 1 for Improving Semantic Segmentation via Video Propagation and Label Relaxation
Figure 2 for Improving Semantic Segmentation via Video Propagation and Label Relaxation
Figure 3 for Improving Semantic Segmentation via Video Propagation and Label Relaxation
Viaarxiv icon