Picture for Suha Kwak

Suha Kwak

Structured State-Space Regularization for Compact and Generation-Friendly Image Tokenization

Add code
Apr 13, 2026
Viaarxiv icon

RePL: Pseudo-label Refinement for Semi-supervised LiDAR Semantic Segmentation

Add code
Apr 08, 2026
Viaarxiv icon

Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model

Add code
Mar 05, 2026
Viaarxiv icon

TextME: Bridging Unseen Modalities Through Text Descriptions

Add code
Feb 03, 2026
Viaarxiv icon

Learned split-spectrum metalens for obstruction-free broadband imaging in the visible

Add code
Jan 27, 2026
Viaarxiv icon

VIRO: Robust and Efficient Neuro-Symbolic Reasoning with Verification for Referring Expression Comprehension

Add code
Jan 19, 2026
Viaarxiv icon

Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection

Add code
Nov 05, 2025
Figure 1 for Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection
Figure 2 for Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection
Figure 3 for Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection
Figure 4 for Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection
Viaarxiv icon

Improving Sound Source Localization with Joint Slot Attention on Image and Audio

Add code
Apr 21, 2025
Figure 1 for Improving Sound Source Localization with Joint Slot Attention on Image and Audio
Figure 2 for Improving Sound Source Localization with Joint Slot Attention on Image and Audio
Figure 3 for Improving Sound Source Localization with Joint Slot Attention on Image and Audio
Figure 4 for Improving Sound Source Localization with Joint Slot Attention on Image and Audio
Viaarxiv icon

DiCoTTA: Domain-invariant Learning for Continual Test-time Adaptation

Add code
Apr 07, 2025
Figure 1 for DiCoTTA: Domain-invariant Learning for Continual Test-time Adaptation
Figure 2 for DiCoTTA: Domain-invariant Learning for Continual Test-time Adaptation
Figure 3 for DiCoTTA: Domain-invariant Learning for Continual Test-time Adaptation
Figure 4 for DiCoTTA: Domain-invariant Learning for Continual Test-time Adaptation
Viaarxiv icon

Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval

Add code
Apr 03, 2025
Viaarxiv icon