Picture for Tingtian Li

Tingtian Li

Unsupervised Modality-Transferable Video Highlight Detection with Representation Activation Sequence Learning

Add code
Mar 18, 2024
Viaarxiv icon

CALM: Contrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis

Add code
Aug 30, 2023
Figure 1 for CALM: Contrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis
Figure 2 for CALM: Contrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis
Figure 3 for CALM: Contrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis
Figure 4 for CALM: Contrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis
Viaarxiv icon

Improved Multiple-Image-Based Reflection Removal Algorithm Using Deep Neural Networks

Add code
Aug 10, 2022
Figure 1 for Improved Multiple-Image-Based Reflection Removal Algorithm Using Deep Neural Networks
Figure 2 for Improved Multiple-Image-Based Reflection Removal Algorithm Using Deep Neural Networks
Figure 3 for Improved Multiple-Image-Based Reflection Removal Algorithm Using Deep Neural Networks
Figure 4 for Improved Multiple-Image-Based Reflection Removal Algorithm Using Deep Neural Networks
Viaarxiv icon

Deep Music Retrieval for Fine-Grained Videos by Exploiting Cross-Modal-Encoded Voice-Overs

Add code
Apr 21, 2021
Figure 1 for Deep Music Retrieval for Fine-Grained Videos by Exploiting Cross-Modal-Encoded Voice-Overs
Figure 2 for Deep Music Retrieval for Fine-Grained Videos by Exploiting Cross-Modal-Encoded Voice-Overs
Viaarxiv icon

Group-Skeleton-Based Human Action Recognition in Complex Events

Add code
Nov 26, 2020
Figure 1 for Group-Skeleton-Based Human Action Recognition in Complex Events
Figure 2 for Group-Skeleton-Based Human Action Recognition in Complex Events
Figure 3 for Group-Skeleton-Based Human Action Recognition in Complex Events
Viaarxiv icon