Picture for Rendong Wang

Rendong Wang

FocusChat: Text-guided Long Video Understanding via Spatiotemporal Information Filtering

Add code
Dec 17, 2024
Viaarxiv icon