Picture for Shuai Wang

Shuai Wang

The Hong Kong University of Science and Technology

ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification

Add code
Jan 14, 2025
Viaarxiv icon

Leveraging Metamemory Mechanisms for Enhanced Data-Free Code Generation in LLMs

Add code
Jan 14, 2025
Viaarxiv icon

Enhancing Large Vision Model in Street Scene Semantic Understanding through Leveraging Posterior Optimization Trajectory

Add code
Jan 03, 2025
Viaarxiv icon

Clutter Resilient Occlusion Avoidance for Tightly-Coupled Motion-Assisted Detection

Add code
Dec 24, 2024
Viaarxiv icon

Surrealistic-like Image Generation with Vision-Language Models

Add code
Dec 18, 2024
Viaarxiv icon

SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor

Add code
Dec 18, 2024
Figure 1 for SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor
Figure 2 for SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor
Figure 3 for SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor
Figure 4 for SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor
Viaarxiv icon

AI PERSONA: Towards Life-long Personalization of LLMs

Add code
Dec 17, 2024
Viaarxiv icon

Hierarchical Control of Emotion Rendering in Speech Synthesis

Add code
Dec 17, 2024
Figure 1 for Hierarchical Control of Emotion Rendering in Speech Synthesis
Figure 2 for Hierarchical Control of Emotion Rendering in Speech Synthesis
Figure 3 for Hierarchical Control of Emotion Rendering in Speech Synthesis
Figure 4 for Hierarchical Control of Emotion Rendering in Speech Synthesis
Viaarxiv icon

RecSys Arena: Pair-wise Recommender System Evaluation with Large Language Models

Add code
Dec 15, 2024
Viaarxiv icon

MoMuSE: Momentum Multi-modal Target Speaker Extraction for Real-time Scenarios with Impaired Visual Cues

Add code
Dec 11, 2024
Viaarxiv icon