Picture for Xuhong Xiao

Xuhong Xiao

Learning Video Context as Interleaved Multimodal Sequences

Add code
Jul 31, 2024
Viaarxiv icon