Picture for Yunzhuo Sun

Yunzhuo Sun

Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection

Add code
Jan 18, 2025
Figure 1 for Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection
Figure 2 for Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection
Figure 3 for Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection
Figure 4 for Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection
Viaarxiv icon

Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models

Add code
Jan 14, 2025
Figure 1 for Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models
Figure 2 for Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models
Figure 3 for Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models
Figure 4 for Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models
Viaarxiv icon

GPTSee: Enhancing Moment Retrieval and Highlight Detection via Description-Based Similarity Features

Add code
Mar 10, 2024
Viaarxiv icon

VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPT

Add code
Mar 04, 2024
Viaarxiv icon

MH-DETR: Video Moment and Highlight Detection with Cross-modal Transformer

Add code
Apr 29, 2023
Figure 1 for MH-DETR: Video Moment and Highlight Detection with Cross-modal Transformer
Figure 2 for MH-DETR: Video Moment and Highlight Detection with Cross-modal Transformer
Figure 3 for MH-DETR: Video Moment and Highlight Detection with Cross-modal Transformer
Figure 4 for MH-DETR: Video Moment and Highlight Detection with Cross-modal Transformer
Viaarxiv icon