Picture for Yongdong Luo

Yongdong Luo

Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension

Add code
Nov 20, 2024
Figure 1 for Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
Figure 2 for Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
Figure 3 for Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
Figure 4 for Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
Viaarxiv icon

Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based Localization

Add code
Apr 17, 2024
Viaarxiv icon

A Unified Framework for 3D Point Cloud Visual Grounding

Add code
Aug 23, 2023
Viaarxiv icon