Picture for Yuke Li

Yuke Li

CoDiff-VC: A Codec-Assisted Diffusion Model for Zero-shot Voice Conversion

Add code
Dec 03, 2024
Viaarxiv icon

Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework

Add code
Oct 14, 2024
Figure 1 for Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework
Figure 2 for Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework
Figure 3 for Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework
Figure 4 for Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework
Viaarxiv icon

Split to Merge: Unifying Separated Modalities for Unsupervised Domain Adaptation

Add code
Mar 11, 2024
Figure 1 for Split to Merge: Unifying Separated Modalities for Unsupervised Domain Adaptation
Figure 2 for Split to Merge: Unifying Separated Modalities for Unsupervised Domain Adaptation
Figure 3 for Split to Merge: Unifying Separated Modalities for Unsupervised Domain Adaptation
Figure 4 for Split to Merge: Unifying Separated Modalities for Unsupervised Domain Adaptation
Viaarxiv icon

Learning Domain-Invariant Temporal Dynamics for Few-Shot Action Recognition

Add code
Feb 20, 2024
Figure 1 for Learning Domain-Invariant Temporal Dynamics for Few-Shot Action Recognition
Figure 2 for Learning Domain-Invariant Temporal Dynamics for Few-Shot Action Recognition
Figure 3 for Learning Domain-Invariant Temporal Dynamics for Few-Shot Action Recognition
Figure 4 for Learning Domain-Invariant Temporal Dynamics for Few-Shot Action Recognition
Viaarxiv icon

HaltingVT: Adaptive Token Halting Transformer for Efficient Video Recognition

Add code
Jan 10, 2024
Figure 1 for HaltingVT: Adaptive Token Halting Transformer for Efficient Video Recognition
Figure 2 for HaltingVT: Adaptive Token Halting Transformer for Efficient Video Recognition
Figure 3 for HaltingVT: Adaptive Token Halting Transformer for Efficient Video Recognition
Figure 4 for HaltingVT: Adaptive Token Halting Transformer for Efficient Video Recognition
Viaarxiv icon

Learning Socio-Temporal Graphs for Multi-Agent Trajectory Prediction

Add code
Dec 22, 2023
Figure 1 for Learning Socio-Temporal Graphs for Multi-Agent Trajectory Prediction
Figure 2 for Learning Socio-Temporal Graphs for Multi-Agent Trajectory Prediction
Figure 3 for Learning Socio-Temporal Graphs for Multi-Agent Trajectory Prediction
Figure 4 for Learning Socio-Temporal Graphs for Multi-Agent Trajectory Prediction
Viaarxiv icon

Enhancing Traffic Object Detection in Variable Illumination with RGB-Event Fusion

Add code
Nov 01, 2023
Viaarxiv icon

Multi-Speaker Expressive Speech Synthesis via Semi-supervised Contrastive Learning

Add code
Oct 26, 2023
Figure 1 for Multi-Speaker Expressive Speech Synthesis via Semi-supervised Contrastive Learning
Figure 2 for Multi-Speaker Expressive Speech Synthesis via Semi-supervised Contrastive Learning
Figure 3 for Multi-Speaker Expressive Speech Synthesis via Semi-supervised Contrastive Learning
Figure 4 for Multi-Speaker Expressive Speech Synthesis via Semi-supervised Contrastive Learning
Viaarxiv icon

LAE-ST-MoE: Boosted Language-Aware Encoder Using Speech Translation Auxiliary Task for E2E Code-switching ASR

Add code
Oct 07, 2023
Figure 1 for LAE-ST-MoE: Boosted Language-Aware Encoder Using Speech Translation Auxiliary Task for E2E Code-switching ASR
Figure 2 for LAE-ST-MoE: Boosted Language-Aware Encoder Using Speech Translation Auxiliary Task for E2E Code-switching ASR
Figure 3 for LAE-ST-MoE: Boosted Language-Aware Encoder Using Speech Translation Auxiliary Task for E2E Code-switching ASR
Figure 4 for LAE-ST-MoE: Boosted Language-Aware Encoder Using Speech Translation Auxiliary Task for E2E Code-switching ASR
Viaarxiv icon

Zero-Shot Emotion Transfer For Cross-Lingual Speech Synthesis

Add code
Oct 06, 2023
Figure 1 for Zero-Shot Emotion Transfer For Cross-Lingual Speech Synthesis
Figure 2 for Zero-Shot Emotion Transfer For Cross-Lingual Speech Synthesis
Figure 3 for Zero-Shot Emotion Transfer For Cross-Lingual Speech Synthesis
Figure 4 for Zero-Shot Emotion Transfer For Cross-Lingual Speech Synthesis
Viaarxiv icon