Picture for Lichao Zhang

Lichao Zhang

Accompanied Singing Voice Synthesis with Fully Text-controlled Melody

Add code
Jul 02, 2024
Viaarxiv icon

Unveiling the Impact of Multi-Modal Interactions on User Engagement: A Comprehensive Evaluation in AI-driven Conversations

Add code
Jun 21, 2024
Viaarxiv icon

Quality and Quantity: Unveiling a Million High-Quality Images for Text-to-Image Synthesis in Fashion Design

Add code
Nov 29, 2023
Viaarxiv icon

Efficient Human-AI Coordination via Preparatory Language-based Convention

Add code
Nov 01, 2023
Viaarxiv icon

Tailored Visions: Enhancing Text-to-Image Generation with Personalized Prompt Rewriting

Add code
Oct 12, 2023
Viaarxiv icon

DisCover: Disentangled Music Representation Learning for Cover Song Identification

Add code
Jul 19, 2023
Figure 1 for DisCover: Disentangled Music Representation Learning for Cover Song Identification
Figure 2 for DisCover: Disentangled Music Representation Learning for Cover Song Identification
Figure 3 for DisCover: Disentangled Music Representation Learning for Cover Song Identification
Figure 4 for DisCover: Disentangled Music Representation Learning for Cover Song Identification
Viaarxiv icon

AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation

Add code
May 24, 2023
Viaarxiv icon

AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment

Add code
May 24, 2023
Viaarxiv icon

Learning Robust Self-attention Features for Speech Emotion Recognition with Label-adaptive Mixup

Add code
May 07, 2023
Viaarxiv icon

TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation

Add code
May 25, 2022
Figure 1 for TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation
Figure 2 for TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation
Figure 3 for TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation
Figure 4 for TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation
Viaarxiv icon