Picture for Eunhwan Park

Eunhwan Park

MERLIN: Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank Pipeline

Add code
Jul 17, 2024
Figure 1 for MERLIN: Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank Pipeline
Figure 2 for MERLIN: Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank Pipeline
Figure 3 for MERLIN: Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank Pipeline
Figure 4 for MERLIN: Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank Pipeline
Viaarxiv icon

Unleash the Potential of CLIP for Video Highlight Detection

Add code
Apr 02, 2024
Figure 1 for Unleash the Potential of CLIP for Video Highlight Detection
Figure 2 for Unleash the Potential of CLIP for Video Highlight Detection
Figure 3 for Unleash the Potential of CLIP for Video Highlight Detection
Viaarxiv icon