Picture for Jongseok Kim

Jongseok Kim

TWLV-I: Analysis and Insights from Holistic Evaluation on Video Foundation Models

Add code
Aug 21, 2024
Viaarxiv icon

Cycled Compositional Learning between Images and Text

Add code
Jul 24, 2021
Figure 1 for Cycled Compositional Learning between Images and Text
Figure 2 for Cycled Compositional Learning between Images and Text
Figure 3 for Cycled Compositional Learning between Images and Text
Figure 4 for Cycled Compositional Learning between Images and Text
Viaarxiv icon

A Joint Sequence Fusion Model for Video Question Answering and Retrieval

Add code
Aug 07, 2018
Figure 1 for A Joint Sequence Fusion Model for Video Question Answering and Retrieval
Figure 2 for A Joint Sequence Fusion Model for Video Question Answering and Retrieval
Figure 3 for A Joint Sequence Fusion Model for Video Question Answering and Retrieval
Figure 4 for A Joint Sequence Fusion Model for Video Question Answering and Retrieval
Viaarxiv icon