Picture for Susan Liang

Susan Liang

VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?

Add code
Nov 19, 2024
Figure 1 for VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
Figure 2 for VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
Figure 3 for VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
Figure 4 for VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
Viaarxiv icon

Scaling Concept With Text-Guided Diffusion Models

Add code
Oct 31, 2024
Viaarxiv icon

Will the Inclusion of Generated Data Amplify Bias Across Generations in Future Image Classification Models?

Add code
Oct 14, 2024
Viaarxiv icon

Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation

Add code
Oct 09, 2024
Figure 1 for Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation
Figure 2 for Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation
Figure 3 for Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation
Figure 4 for Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation
Viaarxiv icon

Learning to Transform Dynamically for Better Adversarial Transferability

Add code
May 23, 2024
Viaarxiv icon

Approximated Likelihood Ratio: A Forward-Only and Parallel Framework for Boosting Neural Network Training

Add code
Mar 18, 2024
Viaarxiv icon

Video Understanding with Large Language Models: A Survey

Add code
Jan 04, 2024
Viaarxiv icon

Scalable CP Decomposition for Tensor Learning using GPU Tensor Cores

Add code
Nov 22, 2023
Viaarxiv icon

Neural Acoustic Context Field: Rendering Realistic Room Impulse Response With Neural Fields

Add code
Sep 27, 2023
Viaarxiv icon

DAVIS: High-Quality Audio-Visual Separation with Generative Diffusion Models

Add code
Jul 31, 2023
Viaarxiv icon