Picture for Hyunjun Kim

Hyunjun Kim

Look Every Frame All at Once: Video-Ma$^2$mba for Efficient Long-form Video Understanding with Multi-Axis Gradient Checkpointing

Add code
Nov 29, 2024
Viaarxiv icon

SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis

Add code
Nov 25, 2024
Figure 1 for SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis
Figure 2 for SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis
Figure 3 for SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis
Figure 4 for SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis
Viaarxiv icon

Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language

Add code
Sep 02, 2024
Viaarxiv icon

CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models

Add code
Jun 04, 2024
Viaarxiv icon

Robust Pedestrian Detection via Constructing Versatile Pedestrian Knowledge Bank

Add code
Apr 30, 2024
Figure 1 for Robust Pedestrian Detection via Constructing Versatile Pedestrian Knowledge Bank
Figure 2 for Robust Pedestrian Detection via Constructing Versatile Pedestrian Knowledge Bank
Figure 3 for Robust Pedestrian Detection via Constructing Versatile Pedestrian Knowledge Bank
Figure 4 for Robust Pedestrian Detection via Constructing Versatile Pedestrian Knowledge Bank
Viaarxiv icon

On the Consideration of AI Openness: Can Good Intent Be Abused?

Add code
Mar 11, 2024
Figure 1 for On the Consideration of AI Openness: Can Good Intent Be Abused?
Figure 2 for On the Consideration of AI Openness: Can Good Intent Be Abused?
Figure 3 for On the Consideration of AI Openness: Can Good Intent Be Abused?
Figure 4 for On the Consideration of AI Openness: Can Good Intent Be Abused?
Viaarxiv icon

Incorporating Language-Driven Appearance Knowledge Units with Visual Cues in Pedestrian Detection

Add code
Nov 02, 2023
Viaarxiv icon

Speaker-adaptive Lip Reading with User-dependent Padding

Add code
Aug 09, 2022
Figure 1 for Speaker-adaptive Lip Reading with User-dependent Padding
Figure 2 for Speaker-adaptive Lip Reading with User-dependent Padding
Figure 3 for Speaker-adaptive Lip Reading with User-dependent Padding
Figure 4 for Speaker-adaptive Lip Reading with User-dependent Padding
Viaarxiv icon

Fast Monte-Carlo Approximation of the Attention Mechanism

Add code
Jan 30, 2022
Figure 1 for Fast Monte-Carlo Approximation of the Attention Mechanism
Figure 2 for Fast Monte-Carlo Approximation of the Attention Mechanism
Figure 3 for Fast Monte-Carlo Approximation of the Attention Mechanism
Figure 4 for Fast Monte-Carlo Approximation of the Attention Mechanism
Viaarxiv icon