Picture for Ji Soo Lee

Ji Soo Lee

VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning

Add code
Jan 12, 2025
Viaarxiv icon

Large Language Models are Temporal and Causal Reasoners for Video Question Answering

Add code
Nov 06, 2023
Viaarxiv icon

Open-vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models

Add code
Aug 18, 2023
Viaarxiv icon