Picture for Hanghang Ma

Hanghang Ma

Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input

Add code
Aug 28, 2024
Figure 1 for Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
Figure 2 for Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
Figure 3 for Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
Figure 4 for Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
Viaarxiv icon