Picture for Siting Xu

Siting Xu

Video Understanding with Large Language Models: A Survey

Add code
Jan 04, 2024
Viaarxiv icon

LaunchpadGPT: Language Model as Music Visualization Designer on Launchpad

Add code
Jul 23, 2023
Viaarxiv icon

Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward

Add code
Sep 25, 2022
Figure 1 for Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward
Figure 2 for Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward
Figure 3 for Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward
Figure 4 for Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward
Viaarxiv icon