Picture for Minjoon Jung

Minjoon Jung

On the Consistency of Video Large Language Models in Temporal Comprehension

Add code
Nov 20, 2024
Viaarxiv icon

PGA: Personalizing Grasping Agents with Single Human-Robot Interaction

Add code
Oct 19, 2023
Figure 1 for PGA: Personalizing Grasping Agents with Single Human-Robot Interaction
Figure 2 for PGA: Personalizing Grasping Agents with Single Human-Robot Interaction
Figure 3 for PGA: Personalizing Grasping Agents with Single Human-Robot Interaction
Figure 4 for PGA: Personalizing Grasping Agents with Single Human-Robot Interaction
Viaarxiv icon

Overcoming Weak Visual-Textual Alignment for Video Moment Retrieval

Add code
Jun 05, 2023
Figure 1 for Overcoming Weak Visual-Textual Alignment for Video Moment Retrieval
Viaarxiv icon

Modal-specific Pseudo Query Generation for Video Corpus Moment Retrieval

Add code
Oct 23, 2022
Viaarxiv icon

Toward a Human-Level Video Understanding Intelligence

Add code
Oct 18, 2021
Figure 1 for Toward a Human-Level Video Understanding Intelligence
Figure 2 for Toward a Human-Level Video Understanding Intelligence
Figure 3 for Toward a Human-Level Video Understanding Intelligence
Viaarxiv icon