Picture for Qinghao Han

Qinghao Han

FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models

Add code
Dec 30, 2024
Figure 1 for FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models
Viaarxiv icon