Picture for Peiwen Sun

Peiwen Sun

Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation

Add code
Oct 14, 2024
Figure 1 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Figure 2 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Figure 3 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Figure 4 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Viaarxiv icon

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Add code
Aug 30, 2024
Figure 1 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 2 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 3 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 4 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Viaarxiv icon

Unveiling and Mitigating Bias in Audio Visual Segmentation

Add code
Jul 23, 2024
Figure 1 for Unveiling and Mitigating Bias in Audio Visual Segmentation
Figure 2 for Unveiling and Mitigating Bias in Audio Visual Segmentation
Figure 3 for Unveiling and Mitigating Bias in Audio Visual Segmentation
Figure 4 for Unveiling and Mitigating Bias in Audio Visual Segmentation
Viaarxiv icon

Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation

Add code
Jul 16, 2024
Viaarxiv icon

Can Textual Semantics Mitigate Sounding Object Segmentation Preference?

Add code
Jul 15, 2024
Viaarxiv icon

Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes

Add code
Jul 15, 2024
Viaarxiv icon

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Add code
Apr 25, 2024
Viaarxiv icon

FusionINN: Invertible Image Fusion for Brain Tumor Monitoring

Add code
Apr 02, 2024
Viaarxiv icon

More than Vanilla Fusion: a Simple, Decoupling-free, Attention Module for Multimodal Fusion Based on Signal Theory

Add code
Dec 12, 2023
Viaarxiv icon

Learning Audio-Visual embedding for Wild Person Verification

Add code
Sep 09, 2022
Figure 1 for Learning Audio-Visual embedding for Wild Person Verification
Figure 2 for Learning Audio-Visual embedding for Wild Person Verification
Figure 3 for Learning Audio-Visual embedding for Wild Person Verification
Figure 4 for Learning Audio-Visual embedding for Wild Person Verification
Viaarxiv icon