Picture for Xiulong Liu

Xiulong Liu

Hearing Anywhere in Any Environment

Add code
Apr 14, 2025
Viaarxiv icon

Enhance Generation Quality of Flow Matching V2A Model via Multi-Step CoT-Like Guidance and Combined Preference Optimization

Add code
Mar 28, 2025
Viaarxiv icon

Building Machine Learning Challenges for Anomaly Detection in Science

Add code
Mar 03, 2025
Viaarxiv icon

Tell What You Hear From What You See -- Video to Audio Generation Through Text

Add code
Nov 08, 2024
Figure 1 for Tell What You Hear From What You See -- Video to Audio Generation Through Text
Figure 2 for Tell What You Hear From What You See -- Video to Audio Generation Through Text
Figure 3 for Tell What You Hear From What You See -- Video to Audio Generation Through Text
Figure 4 for Tell What You Hear From What You See -- Video to Audio Generation Through Text
Viaarxiv icon

CaloChallenge 2022: A Community Challenge for Fast Calorimeter Simulation

Add code
Oct 28, 2024
Figure 1 for CaloChallenge 2022: A Community Challenge for Fast Calorimeter Simulation
Figure 2 for CaloChallenge 2022: A Community Challenge for Fast Calorimeter Simulation
Figure 3 for CaloChallenge 2022: A Community Challenge for Fast Calorimeter Simulation
Figure 4 for CaloChallenge 2022: A Community Challenge for Fast Calorimeter Simulation
Viaarxiv icon

From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and Generation

Add code
Sep 27, 2024
Viaarxiv icon

Calo-VQ: Vector-Quantized Two-Stage Generative Model in Calorimeter Simulation

Add code
May 10, 2024
Viaarxiv icon

MuseChat: A Conversational Music Recommendation System for Videos

Add code
Oct 11, 2023
Figure 1 for MuseChat: A Conversational Music Recommendation System for Videos
Figure 2 for MuseChat: A Conversational Music Recommendation System for Videos
Figure 3 for MuseChat: A Conversational Music Recommendation System for Videos
Figure 4 for MuseChat: A Conversational Music Recommendation System for Videos
Viaarxiv icon

Tackling Data Bias in MUSIC-AVQA: Crafting a Balanced Dataset for Unbiased Question-Answering

Add code
Oct 10, 2023
Viaarxiv icon

Active Sparse Conversations for Improved Audio-Visual Embodied Navigation

Add code
Jun 06, 2023
Figure 1 for Active Sparse Conversations for Improved Audio-Visual Embodied Navigation
Figure 2 for Active Sparse Conversations for Improved Audio-Visual Embodied Navigation
Figure 3 for Active Sparse Conversations for Improved Audio-Visual Embodied Navigation
Figure 4 for Active Sparse Conversations for Improved Audio-Visual Embodied Navigation
Viaarxiv icon