Picture for Shangzhe Di

Shangzhe Di

Unlocking Video-LLM via Agent-of-Thoughts Distillation

Add code
Dec 02, 2024
Viaarxiv icon

Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos

Add code
Aug 26, 2024
Figure 1 for Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos
Figure 2 for Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos
Figure 3 for Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos
Figure 4 for Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos
Viaarxiv icon

Grounded Question-Answering in Long Egocentric Videos

Add code
Dec 11, 2023
Viaarxiv icon

Sparse Dense Fusion for 3D Object Detection

Add code
Apr 09, 2023
Viaarxiv icon

Video Background Music Generation with Controllable Music Transformer

Add code
Nov 16, 2021
Figure 1 for Video Background Music Generation with Controllable Music Transformer
Figure 2 for Video Background Music Generation with Controllable Music Transformer
Figure 3 for Video Background Music Generation with Controllable Music Transformer
Figure 4 for Video Background Music Generation with Controllable Music Transformer
Viaarxiv icon