Picture for Youngwan Lee

Youngwan Lee

MultihopSpatial: Multi-hop Compositional Spatial Reasoning Benchmark for Vision-Language Model

Add code
Mar 19, 2026
Viaarxiv icon

MA-EgoQA: Question Answering over Egocentric Videos from Multiple Embodied Agents

Add code
Mar 11, 2026
Viaarxiv icon

HoliSafe: Holistic Safety Benchmarking and Modeling with Safety Meta Token for Vision-Language Model

Add code
Jun 05, 2025
Viaarxiv icon

VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding

Add code
Dec 03, 2024
Figure 1 for VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding
Figure 2 for VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding
Figure 3 for VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding
Figure 4 for VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding
Viaarxiv icon

Training-Free Exponential Extension of Sliding Window Context with Cascading KV Cache

Add code
Jun 24, 2024
Figure 1 for Training-Free Exponential Extension of Sliding Window Context with Cascading KV Cache
Figure 2 for Training-Free Exponential Extension of Sliding Window Context with Cascading KV Cache
Figure 3 for Training-Free Exponential Extension of Sliding Window Context with Cascading KV Cache
Figure 4 for Training-Free Exponential Extension of Sliding Window Context with Cascading KV Cache
Viaarxiv icon

HiP Attention: Sparse Sub-Quadratic Attention with Hierarchical Attention Pruning

Add code
Jun 14, 2024
Viaarxiv icon

PC-LoRA: Low-Rank Adaptation for Progressive Model Compression with Knowledge Distillation

Add code
Jun 13, 2024
Viaarxiv icon

Visualizing the loss landscape of Self-supervised Vision Transformer

Add code
May 28, 2024
Figure 1 for Visualizing the loss landscape of Self-supervised Vision Transformer
Figure 2 for Visualizing the loss landscape of Self-supervised Vision Transformer
Figure 3 for Visualizing the loss landscape of Self-supervised Vision Transformer
Viaarxiv icon

Dynamic and Super-Personalized Media Ecosystem Driven by Generative AI: Unpredictable Plays Never Repeating The Same

Add code
Feb 19, 2024
Figure 1 for Dynamic and Super-Personalized Media Ecosystem Driven by Generative AI: Unpredictable Plays Never Repeating The Same
Figure 2 for Dynamic and Super-Personalized Media Ecosystem Driven by Generative AI: Unpredictable Plays Never Repeating The Same
Figure 3 for Dynamic and Super-Personalized Media Ecosystem Driven by Generative AI: Unpredictable Plays Never Repeating The Same
Figure 4 for Dynamic and Super-Personalized Media Ecosystem Driven by Generative AI: Unpredictable Plays Never Repeating The Same
Viaarxiv icon

KOALA: Self-Attention Matters in Knowledge Distillation of Latent Diffusion Models for Memory-Efficient and Fast Image Synthesis

Add code
Dec 07, 2023
Figure 1 for KOALA: Self-Attention Matters in Knowledge Distillation of Latent Diffusion Models for Memory-Efficient and Fast Image Synthesis
Figure 2 for KOALA: Self-Attention Matters in Knowledge Distillation of Latent Diffusion Models for Memory-Efficient and Fast Image Synthesis
Figure 3 for KOALA: Self-Attention Matters in Knowledge Distillation of Latent Diffusion Models for Memory-Efficient and Fast Image Synthesis
Figure 4 for KOALA: Self-Attention Matters in Knowledge Distillation of Latent Diffusion Models for Memory-Efficient and Fast Image Synthesis
Viaarxiv icon