Picture for Eunhwan Park

Eunhwan Park

MERLIN: Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank Pipeline

Add code
Jul 17, 2024
Viaarxiv icon

Unleash the Potential of CLIP for Video Highlight Detection

Add code
Apr 02, 2024
Viaarxiv icon