Picture for Benxiang Zhai

Benxiang Zhai

Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection

Add code
Jan 18, 2025
Figure 1 for Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection
Figure 2 for Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection
Figure 3 for Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection
Figure 4 for Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection
Viaarxiv icon

Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models

Add code
Jan 14, 2025
Figure 1 for Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models
Figure 2 for Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models
Figure 3 for Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models
Figure 4 for Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models
Viaarxiv icon

VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPT

Add code
Mar 04, 2024
Viaarxiv icon